https://winbuzzer.com/2025/02/01/mlcommons-and-hugging-face-launch-huge-speech-dataset-with-more-than-a-million-hours-of-audio-xcxwbn/
MLCommons And Hugging Face Launch Huge Speech Dataset With More Than A Million Hours Of Audio -...
Feb 1, 2025 - An extensive multilingual speech dataset from MLCommons and Hugging Face offers over one million hours of audio, setting a new standard for AI-driven speech...
hugging facespeech dataset
Sponsored https://www.fanvue.com/isla-king
Isla King - Fanvue
Hi I'm Isla! After way too much overthinking (and a million should I really do this moments), I finally took the leap. I'm just a girl who's never...
https://mlcommons.org/datasets/unsupervised-peoples-speech/
People's Speech Dataset | MLCommons Datasets
Mar 4, 2025 - The MLCommons People’s Speech Dataset contains 30,000 hours of conversational English speech recognition licensed for academic and commercial machine...
speech dataset mlcommons
https://mlcommons.org/2025/04/ailuminate-french-datasets/
MLCommons Releases French AILuminate Benchmark Demo Prompt Dataset to Github - MLCommons
Apr 16, 2025 - MLCommons announces the release of two French language datasets for the AILuminate benchmark. A 1,200 prompt Creative-Commons licensed version, and 12,000...
mlcommons releasesfrenchdemo
https://mlcommons.org/datasets/peoples-speech/
People's Speech Dataset | MLCommons Datasets
Nov 20, 2024 - The People’s Speech Dataset contains 30,000 hours of conversational English speech recognition licensed for academic and commercial machine learning usage.
speech dataset mlcommons
https://mlcommons.org/datasets/dollar-street/
Dollar Street Dataset | MLCommons Machine Learning Datasets
Nov 20, 2024 - The MLCommons Dollar Street Dataset is a collection of images of everyday household items from homes around the world for machine learning.
dataset mlcommonsdollarstreet
https://mlcommons.org/datasets/cognata/
Cognata Dataset | MLCommons Machine Learning Datasets
Nov 20, 2024 - The MLCommons Cognata Dataset is a set of photorealistic synthetic automotive data frames of urban and highway scenarios to train machine learning (ML) for...
dataset mlcommonscognata
https://mlcommons.org/datasets/multilingual-spoken-words/
Multilingual Spoken Words Dataset | MLCommons Datasets
Nov 20, 2024 - MLCommons Multilingual Spoken Words Dataset Corpus is a large and growing audio dataset of spoken words in 50 languages for machine learning.
dataset mlcommons datasets