Sponsor of the Day:
Jerkmate
https://neurips.cc/virtual/2024/98189
NeurIPS Empirical Upper Bounds for Unstructured Sparsity in Compute-Efficient Language Modeling
compute efficientlanguage modelingneuripsempiricalupper
https://lelapa.ai/about
About Lelapa AI — Resource-Efficient Language AI Built to Scale
Lelapa AI is a research and product lab building resource-efficient language systems designed for real-world deployment and global scalability.
lelapa airesource efficientlanguage builtscale
https://blog.openresty.com/en/edgelang-intro/
EdgeLang: A Powerful and Efficient Language for Gateway Logic - OpenResty Official Blog
openresty official blogefficient languageedgelangpowerfulgateway
https://j-min.io/publication/perceivervl_wacv2023/
Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention | Jaemin Cho
Sep 23, 2023 - Efficient VL modeling with Perceiver-based iterative cross-attentions - *[WACV 2023](https://nips.cc/Conferences/2021)*
efficient visionlanguage modelingjaemin choperceivervl
https://arxiv.org/abs/2509.22186
[2509.22186] MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document...
Abstract page for arXiv paper 2509.22186: MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
vision language modelefficient high2509decoupledresolution
https://www.liquid.ai/blog/lfm2-vl-efficient-vision-language-models
LFM2-VL: Efficient Vision-Language Models | Liquid AI
Oct 21, 2025 - Today, we release LFM2-VL, our first series of vision-language foundation models. These multimodal models are designed for low-latency and device-aware...
vision language modelsliquid ailfm2vlefficient
https://sakana.ai/taid/
TAID: A Novel Method for Efficient Knowledge Transfer from Large Language Models to Small Language...
TAID: A Novel Method for Efficient Knowledge Transfer from Large Language Models to Small Language Models
large language modelsnovel methodknowledge transfertaidefficient
https://arxiv.org/abs/2309.06180
[2309.06180] Efficient Memory Management for Large Language Model Serving with PagedAttention
Abstract page for arXiv paper 2309.06180: Efficient Memory Management for Large Language Model Serving with PagedAttention
large language modelefficient memory2309managementserving
https://huggingface.co/papers/2309.06180
Paper page - Efficient Memory Management for Large Language Model Serving with PagedAttention
Join the discussion on this paper page
large language modelefficient memorypapermanagementserving
https://arxiv.org/abs/2309.00071
[2309.00071] YaRN: Efficient Context Window Extension of Large Language Models
Abstract page for arXiv paper 2309.00071: YaRN: Efficient Context Window Extension of Large Language Models
large language modelsefficient context2309yarnwindow
https://techcrunch.com/2023/07/11/fileread/
Backed by Gradient, Fileread uses large language models to make legal discovery more efficient |...
Jul 12, 2023 - Legal discovery is one of the most time-consuming parts of litigation and typically involves team of specialists combing through towers of documents. Fileread,...
large language modelsbackedgradientusesmake
https://www.semanticscholar.org/reader/57e849d0de13ed5f91d086936296721d4ff75a75
[PDF] LLaMA: Open and Efficient Foundation Language Models | Semantic Scholar
An academic search engine that utilizes artificial intelligence methods to provide highly relevant results and novel tools to filter them with ease.
models semantic scholarpdfllamaopenefficient
https://lelapa.ai/blog
Blog — Insights on Language AI & Resource-Efficient Systems | Lelapa AI
Insights on language AI, resource-efficient systems, and building technology for the Global South from the Lelapa AI team.
language airesource efficientbloginsightssystems
https://huggingface.co/papers/2309.00071
Paper page - YaRN: Efficient Context Window Extension of Large Language Models
Join the discussion on this paper page
large language modelsefficient contextpaperyarnwindow
https://huggingface.co/papers/2403.13372
Paper page - LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Join the discussion on this paper page
efficient fine tuninglanguage modelspaperllamafactoryunified
https://www.huaweicloud.com/intl/en-us/product/cloudbuild.html
CodeArts Build – Multi-language – Efficient-HUAWEI CLOUD
An easy-to-configure platform that supports multi-language parallel builds on the cloud. Its distributed acceleration helps enterprises improve build...
multi languagehuawei cloudcodeartsbuildefficient
https://arxiv.org/abs/2403.13372
[2403.13372] LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Abstract page for arXiv paper 2403.13372: LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
efficient fine tuninglanguage models240313372llamafactory
https://huggingface.co/papers/2509.22186
Paper page - MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document...
Join the discussion on this paper page
vision language modelefficient highpaper5decoupled
https://www.amazon.science/publications/exploring-fine-tuning-for-in-context-retrieval-and-efficient-kv-caching-in-long-context-language-models
Exploring fine-tuning for in-context retrieval and efficient KV-caching in long-context language...
With context windows of millions of tokens, Long-Context Language Models (LCLMs) can encode entire document collections, offering a strong alternative to...
fine tuningexploringcontextretrievalefficient