https://www.anthropic.com/research/assistant-axis
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
assistantaxisstabilizingcharacterlarge
https://www.coursera.org/courses?query=large+language+models&page=3
Large Language Models courses from top universities and industry leaders. Learn Large Language Models online with courses like Sequence Models and LangChain...
large language modelscourses learntoponline
https://deepai.org/publication/code-prompting-a-neural-symbolic-method-for-complex-reasoning-in-large-language-models
05/29/23 - Large language models (LLMs) have scaled up to unlock a wide range of complex reasoning tasks with the aid of various prompting me...
symbolic methodcodepromptingneuralcomplex
https://www.timeshighereducation.com/research/qassim-university/using-large-language-models-process-arabic-text
For all of the controversy surrounding large language models, these tools present science with the chance to transform lives for the better. At Qassim...
large language modelsarabic textusingprocesstimes
https://openreview.net/forum?id=Le9anH3kv1&referrer=%5Bthe%20profile%20of%20Sam%20Havens%5D(%2Fprofile%3Fid%3D~Sam_Havens1)
Retrieval Augmented Generation (RAG) has emerged as a crucial technique for enhancing the accuracy of Large Language Models (LLMs) by incorporating external...
large language modelslong contextragperformanceopenreview
https://www.cncf.io/blog/2026/02/04/conversing-with-large-language-models-using-dapr/
Imagine you are running a bunch of microservices, each living within its own boundary. What are some of the challenges that come into mind when operating them?
large language modelsconversingusingdaprcncf
https://arxiv.org/abs/2509.19803?utm_source=www.turingpost.com&utm_medium=referral&utm_campaign=fod-120-grpo-why-is-everybody-talking-about-it-this-weekend
Abstract page for arXiv paper 2509.19803: VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models
reinforcement learningvariancebasedcurriculumlarge
https://www.nvidia.com/en-gb/glossary/large-language-models/
Large language models (LLMs) are deep learning algorithms that can recognize, summarize, translate, predict, and generate content using very large datasets.
large language modelsnvidiaglossary
https://aclanthology.org/2024.findings-acl.582/
Bowen Shen, Zheng Lin, Daren Zha, Wei Liu, Jian Luan, Bin Wang, Weiping Wang. Findings of the Association for Computational Linguistics: ACL 2024. 2024.
large language modelspruningintramodulelow
https://theamericangenius.com/large-language-models/
May 4, 2025 - (TECHNOLOGY) Large language models guide our AI training and recently, ethicists have pointed out serious flaws in LLMs (which cost some their jobs).
large language modelsllmsbecome
https://openreview.net/forum?id=WkpqUVcSTy&referrer=%5Bthe%20profile%20of%20Hong-You%20Chen%5D(%2Fprofile%3Fid%3D~Hong-You_Chen1)
We propose SlowFast-LLaVA (or SF-LLaVA for short), a training-free video large language model (LLM) that can jointly capture the detailed spatial semantics and...
strongtrainingfreebaselinevideo
https://openreview.net/forum?id=EfTuzTijDo&referrer=%5Bthe%20profile%20of%20Tuo%20Zhao%5D(%2Fprofile%3Fid%3D~Tuo_Zhao2)
Large language models (LLMs) exhibit remarkable performance across various natural language processing tasks but suffer from immense computational and memory...
unified frameworkshapepreservingcompression
https://openreview.net/forum?id=G9qA1JZ0Sy&referrer=%5Bthe%20profile%20of%20Jingyang%20Qiao%5D(%2Fprofile%3Fid%3D~Jingyang_Qiao1)
Instruction tuning guides the Multimodal Large Language Models (MLLMs) in aligning different modalities by designing text instructions, which seems to be an...
large languagemultimodalcontinualassistantopenreview
https://arxiv.org/abs/2512.21859v1
Abstract page for arXiv paper 2512.21859v1: TimeBill: Time-Budgeted Inference for Large Language Models
large language modelstimeinference
https://openreview.net/forum?id=B4SFmNvBNz&referrer=%5Bthe%20profile%20of%20Amin%20Karbasi%5D(%2Fprofile%3Fid%3D~Amin_Karbasi3)
Is automated hallucination detection fundamentally possible? In this paper, we introduce a theoretical framework to rigorously study the (im)possibility of...
large language modelsimpossibilityautomatedhallucination
https://arxiv.org/abs/2310.06225?utm_source=patricks-newsletter-b6993b.beehiiv.com&utm_medium=referral&utm_campaign=leveraging-llms-in-agriculture-a-path-forward
Abstract page for arXiv paper 2310.06225: GPT-4 as an Agronomist Assistant? Answering Agriculture Exams Using Large Language Models
gptagronomistassistantansweringagriculture
https://devtalk.com/deepseek
DeepSeek portal on Devtalk - see what's trending, discuss, share news, articles, blog posts, libraries or ask questions about DeepSeek in our forum.
large language modeldeepseekportal
https://www.cochrane.org/ru/events/opportunities-and-challenges-data-extraction-large-language-model
Image Data extraction in evidence synthesis is labour-intensive, costly, and prone to errors. The use of large language models (LLMs) presents a promising...
data extractionlarge languageopportunitieschallenges
https://arxiv.org/abs/2407.12835
Abstract page for arXiv paper 2407.12835: Regurgitative Training: The Value of Real Data in Training Large Language Models
real datatrainingvalue
https://www.amazon.science/publications/a-preference-driven-paradigm-for-enhanced-translation-with-large-language-models
Recent research has shown that large language models (LLMs) can achieve remarkable translation performance through supervised fine tuning (SFT) using only a...
enhanced translationlarge languagepreferencedrivenparadigm
https://issue1.forestfriends.tech/
Chances are, you've been building your LLM app with "vibes-based engineering", where you look at the LLM outputs in dev and eyeball it with "Looks good to me"....
large language modelsystemevalswild
https://aclanthology.org/2024.acl-long.703/
Zhaochen Su, Juntao Li, Jun Zhang, Tong Zhu, Xiaoye Qu, Pan Zhou, Yan Bowen, Yu Cheng, Min Zhang. Proceedings of the 62nd Annual Meeting of the Association for...
large language modelslivingmomentgraspco
https://aclanthology.org/2024.findings-acl.703/
Linhao Yu, Yongqi Leng, Yufei Huang, Shang Wu, Haixin Liu, Xinmeng Ji, Jiahui Zhao, Jinwang Song, Tingting Cui, Xiaoqing Cheng, Tao Liu, Deyi Xiong. Findings...
large language modelsmoralevaluationbenchmarkchinese
https://www.arxiv.org/abs/2601.08833
Abstract page for arXiv paper 2601.08833: Revisiting Disaggregated Large Language Model Serving for Performance and Energy Implications
large language modelservingperformance
https://carleton.ca/slals/event/generative-ai-and-large-language-models/
Please join Don Myles, the Faculty Teaching Mentor for the School of Linguistics and Language Studies, for an informal brown bag session on the impact of...
large language modelsbrown baggenerative aisessionschool
https://www.wired.it/article/large-language-model-benchmark-test-misurazioni-modelli/
Jul 12, 2025 - Pro e contro dei benchmark, gli strumenti con cui si cerca di valutare ciò che sta diventando sempre più difficile da valutare: il livello raggiunto dai...
large language modelbenchmarkcomefacciamoquanto
https://www.datacamp.com/code-along/using-large-language-models-with-the-cohere-api
In this session, you'll learn to use the Cohere API with Python to generate content based on a given prompt, extract information from documents, and build a...
large language modelsusingcohereapidatacamp
https://openreview.net/forum?id=HcyVr9SlwR&referrer=%5Bthe%20profile%20of%20Hongyuan%20Lu%5D(%2Fprofile%3Fid%3D~Hongyuan_Lu2)
Data contamination gradually becomes inevitable during the development of large language models (LLMs), meaning the training data commonly integrates those...
lbglnebasedblockinggeneration
https://observablehq.com/@sorami/sizes-of-large-language-models
Inspired by the figure in the DistilBERT paper. (Sanh et al., 2019) DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
large language modelssizessoramihisamotoobservable
https://explodingtopics.com/blog/list-of-llms
A regularly updated list of LLMs disrupting the artificial intelligence space.
large language modelsbestllms
https://www.ibm.com/think/topics/large-language-models
Large language models are AI systems capable of understanding and generating human language by processing vast amounts of text data.
large language modelsllmsibm
https://www.amazon.science/publications/contextual-asr-with-retrieval-augmented-large-language-model
Automatic speech recognition (ASR) systems can benefit from incorporating contextual information to improve recognition accuracy, especially for uncommon words...
large language modelamazon sciencecontextualasrretrieval
https://arxiv.org/abs/2310.03214?utm_source=www.therundown.ai&utm_medium=referral&utm_campaign=gpt-4-vision-s-newest-competitor-is-free
Abstract page for arXiv paper 2310.03214: FreshLLMs: Refreshing Large Language Models with Search Engine Augmentation
large language modelssearch enginerefreshing
https://www.llama2.space/
Experience the power of Llama 2, the second-generation Large Language Model by Meta. Choose from three model sizes, pre-trained on 2 trillion tokens, and...
large language modelnext genllamaonlinemeta
https://www.sei.cmu.edu/blog/application-of-large-language-models-llms-in-software-engineering-overblown-hype-or-disruptive-change/
This blog post explores large language models (LLMs) in software development, implications of incorporating LLMs into software-reliant systems, and areas where...
large language modelssoftware engineeringapplicationllmsoverblown
https://aimmediahouse.com/conference-videos/ai-observability-the-key-to-unlocking-the-full-potential-of-large-language-models
Aug 14, 2024 - AI observability is essential to really unlock the full potential of these models.
ai observabilityfull potentialkeyunlocking
https://aclanthology.org/2024.emnlp-main.1070/
Chen Wang, Minpeng Liao, Zhongqiang Huang, Junhong Wu, Chengqing Zong, Jiajun Zhang. Proceedings of the 2024 Conference on Empirical Methods in Natural...
speech languageacl anthologyblspemotowards
https://www.cochrane.org/es/events/opportunities-and-challenges-data-extraction-large-language-model
Image Data extraction in evidence synthesis is labour-intensive, costly, and prone to errors. The use of large language models (LLMs) presents a promising...
data extractionlarge languageopportunitieschallenges
https://aclanthology.org/2024.acl-long.221/
Ning Bian, Xianpei Han, Hongyu Lin, Yaojie Lu, Ben He, Le Sun. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume...
rulestorybettercommonsenseexpression
https://www.rackspace.com/en-ae/blog/large-language-models-llms-vs-small-language-models-slms
This article explores the differences between large language models (LLMs) and small language models (SLMs), highlighting their advantages and applications....
large language modelsllmsvssmallrackspace
https://towardsdatascience.com/your-next-large-language-model-might-not-be-large-afterall-2/
Nov 28, 2025 - A 27M-parameter model just outperformed giants like DeepSeek R1, o3-mini, and Claude 3.7 on reasoning tasks
language modelnextmightlarge
https://localllm.in/
Your definitive resource for Local Large Language Models. Learn how to run AI models on your own hardware with our comprehensive guides, tutorials, and tools.
large language modelsguidelocal
https://blog.talosintelligence.com/cybercriminal-abuse-of-large-language-models/
Cybercriminals are increasingly gravitating towards uncensored LLMs, cybercriminal-designed LLMs and jailbreaking legitimate LLMs.
large language modelscybercriminalabuse
https://jailbreak-llms.xinyueshen.me/
Deformable Neural Radiance Fields creates free-viewpoint portraits (nerfies) from casually captured videos.
anythingevaluatingwildjailbreak
https://www.geeky-gadgets.com/minicpm-2b-llm/
MiniCPM 2B is a new large language model (LLM) which is performing extremely well even with only 2 billion parameters. Providing competition
large language modelsmallyetpowerfulllm
https://onehack.st/t/gorilla-large-language-model-connected-with-massive-apis/295149
Gorilla: Large Language Model Connected with Massive APIs [Project Website] :fire_engine: GoEx: A Runtime for executing LLM generated actions like code & API...
large language modelgorillaconnectedmassiveapis
https://research.google/blog/privacy-considerations-in-large-language-models/
Posted by Nicholas Carlini, Research Scientist, Google Research Machine learning-based language models trained to predict the next word in a senten...
large language modelsprivacy considerations
https://www.hostinger.com/au/tutorials/large-language-models
Large language models (LLMs) power today's AI systems, from chatbots to assistants. Read to learn what LLMs are and how they work.
large language modelsbest optionsexamples
https://www.databricks.com/blog/coreweave-nvidia-h100-part-1?itm_source=www&itm_category=home&itm_page=home&itm_component=general-asset-card&itm_offer=coreweave-nvidia-h100-part-1
The research and engineering teams here at MosaicML collaborated with CoreWeave, one of the leading cloud providers for NVIDIA GPU-accelerated server...
large language modelsbenchmarkingnvidiagpuscoreweave
https://www.alibabacloud.com/blog/streamlined-deployment-and-integration-of-large-language-models-with-pai-eas_600762
This article provides a comprehensive guide on deploying a large language model (LLM) application using the Platform for AI - Elastic Algorithm Service.
large language modelsstreamlineddeploymentintegrationpai
https://news.sophos.com/en-us/2024/03/18/benchmarking-the-security-capabilities-of-large-language-models/
Comparative Sophos X-Ops testing not only indicates which models fare best in cybersecurity, but where cybersecurity fares best in AI
large language modelssecurity capabilitiesbenchmarkingsophos
https://snorkel.ai/large-language-models/
Explore the evolution, strengths, and limitations of large language models in AI with Snorkel AI’s expert breakdown.
large language modelshistoryprosampcons
https://www.packtpub.com/en-ph/learning/author-posts/the-complete-guide-to-nlp-foundations-techniques-and-large-language-models
Explore "Mastering NLP from Foundations to LLMs," a comprehensive guide for beginners and experts in Natural Language Processing. This book covers everything...
complete guidelarge languagenlpfoundationstechniques
https://www.cudocompute.com/blog/what-is-the-cost-of-training-large-language-models
May 12, 2025 - Explore the true cost of training large language models. Learn about the financial, computational, and environmental costs of AI's most advanced models.
large language modelscosttraining
https://openreview.net/forum?id=SBoRhRCzM3
Large Language Models (LLMs) have achieved remarkable success in reasoning tasks with the development of prompting methods. However, existing prompting...
thoughtpropagationanalogicalapproachcomplex
https://projectzero.google/2024/10/from-naptime-to-big-sleep.html?utm_source=www.hungryminds.dev&utm_medium=referral&utm_campaign=how-robinhood-wins-at-trading-graph-algorithms-101
Posted by the Big Sleep team Introduction In our previous post, Project Naptime: Evaluating Offensive Security Capabilities of Large Language Models, we int...
large language modelsbig sleepnaptimeusing
https://www.nextplatform.com/2023/04/10/the-crazy-eights-of-large-language-models/
Sep 25, 2023 - We read a fairly large number of technical papers here at The Next Platform, and it is a rare thing indeed when we can recommend that everyone – or damned
large language modelscrazy eights
https://openreview.net/forum?id=0myKAuHN3M&referrer=%5Bthe%20profile%20of%20Vinija%20Jain%5D(%2Fprofile%3Fid%3D~Vinija_Jain1)
Assessing the effectiveness of large language models (LLMs) in performing different tasks is crucial for understanding their strengths and weaknesses. This...
evaluation frameworklarge languagehierarchicalpromptingtaxonomy
https://www.frontiersin.org/journals/artificial-intelligence/articles/10.3389/frai.2025.1533508/full
BackgroundClinical data is instrumental to medical research, machine learning (ML) model development, and advancing surgical care, but access is often constr...
large language modelsfrontiersgeneratingsyntheticclinical
https://openreview.net/forum?id=rQ7fz9NO7f&referrer=%5Bthe%20profile%20of%20Gang%20Liu%5D(%2Fprofile%3Fid%3D~Gang_Liu6)
While large language models (LLMs) have integrated images, adapting them to graphs remains challenging, limiting their applications in materials and drug...
large language modelsmultimodalinversemoleculardesign
https://www.technologyreview.com/2022/11/18/1063487/meta-large-language-model-ai-only-survived-three-days-gpt-3-science/
Nov 22, 2022 - Galactica was supposed to help scientists. Instead, it mindlessly spat out biased and incorrect nonsense.
large language modelthree dayslatest
https://www.analyticsvidhya.com/blog/category/large-language-models/
Large language models like ChatGPT are transforming communication. Learn how they can enhance your writing, translation, and more. [Read now]
large language modelsanalytics vidhyaarchives
https://aclanthology.org/2024.emnlp-main.805/
Yougang Lyu, Lingyong Yan, Shuaiqiang Wang, Haibo Shi, Dawei Yin, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren. Proceedings of the 2024 Conference...
large language modelsfine tuningknowledgeawareacl