https://openreview.net/forum?id=N3WOfE4bvw&referrer=%5Bthe%20profile%20of%20Carlo%20Luschi%5D(%2Fprofile%3Fid%3D~Carlo_Luschi1)
Retrieval of information from graph-structured knowledge bases represents a promising direction for improving the factuality of LLMs. While various solutions...
ground truthsubgraphsbettertrainingevaluation
https://www.nvidia.com/en-us/on-demand/session/aisummitdc24-sdc1061/
Advancements in Large Language Models (LLMs) have enabled developers to create a variety of applications such as code generation, translation, and text sum
data processingtraining llmsscalesummit
https://openreview.net/forum?id=ElgAzv9fNk&referrer=%5Bthe%20profile%20of%20Dan%20Alistarh%5D(%2Fprofile%3Fid%3D~Dan_Alistarh7)
One main approach to reducing the massive costs of large language models (LLMs) is the use of quantized or sparse representations for training or deployment....
training accuratequestllmshighlycompressed
https://www.sonarsource.com/de/products/sonarsweep/
Sonarweep validates quality coding datasets, deduplicating, fixing issues & filtering noise to help your LLM train efficiently and produce more secure,...
ensure qualitytraining datacodingllmssonar
https://aclanthology.org/2024.inlg-main.36/
Shivprasad Rajendra Sagare, Hemachandran S, Kinshuk Sarabhai, Prashant Ullegaddi, Rajeshkumar Sa. Proceedings of the 17th International Natural Language...
audio visualvideo texttrainingimprovedgrounding
https://aclanthology.org/2024.finnlp-1.33/
Jens Van Nooten, Andriy Kosar. Proceedings of the Joint Workshop of the 7th Financial Technology and Natural Language Processing, the 5th Knowledge Discovery...
csr themeadvancingtopicclassificationllms
https://openreview.net/forum?id=zmDXUAmjRn&referrer=%5Bthe%20profile%20of%20Dawen%20Liang%5D(%2Fprofile%3Fid%3D~Dawen_Liang1)
We propose **DiFF**PO, **Di**ffusion **F**ast and **F**urious Policy Optimization, a unified framework for training masked diffusion large language models...
trainingdiffusionllmsreasonfast
https://www.deeplearning.ai/courses/fine-tuning-and-reinforcement-learning-for-llms-intro-to-post-training/
Apply fine-tuning and reinforcement learning techniques to shape model behavior, improve reasoning, and make LLMs safer and more reliable.
fine tuningreinforcement learningllmsintropost
https://openreview.net/forum?id=ZDpPfg9pDc&referrer=%5Bthe%20profile%20of%20Kai-Chiang%20Wu%5D(%2Fprofile%3Fid%3D~Kai-Chiang_Wu1)
The immense model sizes of large language models (LLMs) challenge deployment on memory-limited consumer GPUs. Although model compression and parameter...
speculatedeepaccuratelosslesstraining