Robuta

https://imbue.com/research/70b-evals/
This is the first of a three-part series on how we trained our 70B model. We covered setting up infrastructure, conducting evaluations, and…
open sourcenatural languagecode understandingsanitizeddatasets
https://agenthunt.io/agent/detail/huggingface-co/
Discover Hugging Face, the ultimate AI platform for machine learning innovation. Access thousands of pre-trained models, collaborate with a global community,...
open source aihugging facemodelsdatasetsamp
https://www.tensorflow.org/datasets/catalog/waymo_open_dataset?authuser=1&hl=fa
open datasettensorflow datasetswaymo
https://www.kaggle.com/datasets?license=cc
Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More....
machine learning projectsopen datasetsfindkaggle
https://www.stlouis-mo.gov/data/departments/department.cfm?id=93
All open datasets for a given department
open datahealthhospitalsdirectoroffice
https://www.kaggle.com/datasets?search=Inventory
Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More....
machine learning projectsopen datasetsfindkaggle
https://aws.amazon.com/about-aws/whats-new/2021/10/new-datasets-available-on-the-registry-of-open-data/
Discover more about what's new at AWS with New datasets available on the Registry of Open Data from University of Sydney, International Brain Laboratory,...
open datanewdatasetsavailableregistry
https://www.stlouis-mo.gov/data/departments/department.cfm?id=30
All open datasets for a given department
open databoardaldermendatasets
https://mutace-papousku.com/
mutacebioinformaticsblogaviangenomes
https://deepai.org/publication/the-asru-2019-mandarin-english-code-switching-speech-recognition-challenge-open-datasets-tracks-methods-and-results
07/12/20 - Code-switching (CS) is a common phenomenon and recognizing CS speech is challenging. But CS speech data is scarce and there' s no ...
code switchingspeech recognitionasrumandarinenglish
https://www.tensorflow.org/datasets/catalog/open_images_v4?hl=ja
open imagestensorflow datasets
https://huggingface.co/papers/2407.10953
Join the discussion on this paper page
papermmmmultilingualmutualreinforcement
https://www.silicon.fr/data-ia-1372/open-r1-deepseek-224586
Nov 27, 2025 - Après la phase axée sur les datasets, le projet - qui vise une reproduction ouverte de DeepSeek-R1 - a basculé sur le pipeline d'apprentissage.
lesdatasetsopencherchepipeline
https://deepai.org/publication/the-accented-english-speech-recognition-challenge-2020-open-datasets-tracks-baselines-results-and-methods
02/20/21 - The variety of accents has posed a big challenge to speech recognition. The Accented English Speech Recognition Challenge (AESRC20...
english speechopen datasetsaccentedrecognitionchallenge
https://www.kaggle.com/datasets
Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More....
machine learning projectsopen datasetsfindkaggle
https://hackernoon.com/visualizing-promptable-and-open-vocabulary-segmentation-across-multiple-datasets
Explore a collection of visualizations demonstrating the effectiveness of promptable and open-vocabulary segmentation across various datasets.
visualizingopenvocabularysegmentationacross
https://www.tensorflow.org/datasets/catalog/open_images_challenge2019_detection?authuser=1&hl=tr
open imagestensorflow datasetsdetection
https://www.stlouis-mo.gov/data/datasets/index.cfm
Browse all public registered City of St. Louis datasets
open datasets
https://www.stlouis-mo.gov/data/formats/format.cfm?id=17
All datasets with distributions with a given format
open dataspreadsheetdatasets
https://www.tensorflow.org/datasets/catalog/open_images_v4?authuser=0&hl=he
open imagestensorflow datasets
https://www.tensorflow.org/datasets/catalog/natural_questions_open?hl=id
natural questionstensorflow datasetsopen
https://www.analyticsvidhya.com/blog/2021/05/top-10-open-source-datasets-for-object-detection-machine-learning-in-2021/?utm_source=reading_list&utm_medium=https://www.analyticsvidhya.com/blog/2019/01/build-image-classification-model-10-minutes/
The article comprises ten open-source datasets for object detection in machine learning in 2021 with the respective sources.
open sourceobject detectiondatasets
https://aclanthology.org/2025.coling-main.725/
Yuming Yang, Wantong Zhao, Caishuang Huang, Junjie Ye, Xiao Wang, Huiyuan Zheng, Yang Nan, Yuran Wang, Xueying Xu, Kaixin Huang, Yunke Zhang, Tao Gui, Qi...
beyond boundarieslearninguniversalentitytaxonomy
https://www.kaggle.com/datasets?search=image+classification
Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More....
machine learning projectsopen datasetsfindkaggle
https://zenodo.org/communities/gmap/records?q=&f=resource_type%3Apresentation&l=list&p=1&s=10&sort=newest
search opendatasetscodepapersgmap
https://www.tensorflow.org/datasets/catalog/open_images_challenge2019_detection?authuser=2&hl=vi
open imagestensorflow datasetsdetection
https://www.kaggle.com/datasets?sortBy=relevance&group=public&search=iris&page=1&pageSize=20&size=all&filetype=all&license=all
Download Open Datasets on 1000s of Projects + Share Projects on One Platform. Explore Popular Topics Like Government, Sports, Medicine, Fintech, Food, More....
machine learning projectsopen datasetsfindkaggle