Skip to main content

CSV

Twitter is one of the most popular social networks for sentiment analysis. This data set of tweets are related to the stock market. We collected 943,672 tweets between April 9 and July 16, 2020, using the S&P 500 tag (#SPX500), the references to the top 25 companies in the S&P 500 index, and the Bloomberg tag (#stocks). 1,300 out of the 943,672 tweets were manually annotated in positive, neutral, or negative classes. A second independent annotator reviewed the manually annotated tweets.

Categories:

This dataset has been developed based on the work of the GeoCOV19Tweets Dataset. The original work by Lamsal, R. runs network analysis on a similar dataset to understand the underlying relationship between countries and hashtags. The work did an analysis on roughly 300k number of [country, hashtag] relations from 190 countries and territories, and 5055 unique hashtags. This work pushes the number of relationships by 3 times.

Categories:

LiDAR point cloud data serves as an machine vision alternative other than image. Its advantages when compared to image and video includes depth estimation and distance measruement. Low-density LiDAR point cloud data can be used to achieve navigation, obstacle detection and obstacle avoidance for mobile robots. autonomous vehicle and drones. In this metadata, we scanned over 1200 objects and classified it into 4 groups of object namely, human, cars, motorcyclist.

Categories:

Automatic humor detection has interesting use cases in modern technologies, such as chatbots and virtual assistants. Existing humor detection datasets usually combined formal non-humorous texts and informal jokes with incompatible statistics (text length, words count, etc.). This makes it more likely to detect humor with simple analytical models and without understanding the underlying latent lingual features and structures.

Categories:

This is the dataset provided and collected while "Car Hacking: Attack & Defense Challenge" in 2020. We are the main organizer of the competition along with Culture Makers and Korea Internet & Security Agency. We are very proud of releasing these valuable datasets for all security researchers for free.

The competition aimed to develop attack and detection techniques of Controller Area Network (CAN), a widely used standard of in-vehicle network. The target vehicle of competition was Hyundai Avante CN7.

Categories:
We disclose a traffic landmark dataset for detection.The dataset generated with our framework includes about 150,000 images and annotations of about 470,000 traffic landmarks.Our dataset was collected in an urban area of Seoul and suburban areas of Suwon, Hwaseong, Yongin, and Seongnam in South Korea at different times of the day.Images taken in the morning or evening included a large number of saturated areas due to exposure to direct sunlight.Most images taken under the light condition of the late evening was low-contrast.The images taken at noon included the reflection of the windshield due
Categories: