UCI and OpenML Data Sets for Ordinal Quantification
These four labeled data sets are targeted at ordinal quantification. The goal of quantification is not to predict the label of each individual instance, but the distribution of labels in unlabeled sets of data.
Twitter Discussions Topics dataset
Dataset of tweet IDs and timestamps in CSV format, organized by COVID-19-related discussion topics they belong to.
FAVCI2D - Face verification dataset focused on demographic diversity and difficult imposters
FaVCI2D tackles problematic design choices of existing face verification datasets: (1) imposter pairs are too easy, (2) the demographic diversity is insufficient, and (3) there is disregard for ethical and legal aspects.
Bus Violence: a large-scale benchmark for video violence detection in public transport
The Bus Violence dataset is a large-scale collection of videos depicting violent and non-violent situations in public transport environments.
The ImageCLEFAware 2021 Dataset
Images constitute a large part of the content shared on social networks.
100-Driver: A Large-scale, Diverse Dataset for Distracted Driver Classification
A large-scale, diverse posture-based distracted diver dataset, with more than 470K images taken by 4 cameras observing 100 drivers over 79 hours from 5 vehicles.