Multilingual, multimodal, and updated frequently.
We're not only releasing 'one kind of data.'
The public list spans speech, text, vision/video, and even medical imaging.
Call-center audio + transcripts
Multilingual short utterances
Accented English (incl. AAVE)
PII detection pack
MRI DICOM (1.5T)
Video: damaged cars + human activities