Robuta

Sponsor of the Day: Jerkmate
https://neurips.cc/virtual/2024/98189 NeurIPS Empirical Upper Bounds for Unstructured Sparsity in Compute-Efficient Language Modeling compute efficientlanguage modelingneuripsempiricalupper https://lelapa.ai/about About Lelapa AI — Resource-Efficient Language AI Built to Scale Lelapa AI is a research and product lab building resource-efficient language systems designed for real-world deployment and global scalability. lelapa airesource efficientlanguage builtscale https://blog.openresty.com/en/edgelang-intro/ EdgeLang: A Powerful and Efficient Language for Gateway Logic - OpenResty Official Blog openresty official blogefficient languageedgelangpowerfulgateway https://j-min.io/publication/perceivervl_wacv2023/ Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention | Jaemin Cho Sep 23, 2023 - Efficient VL modeling with Perceiver-based iterative cross-attentions - *[WACV 2023](https://nips.cc/Conferences/2021)* efficient visionlanguage modelingjaemin choperceivervl https://arxiv.org/abs/2509.22186 [2509.22186] MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document... Abstract page for arXiv paper 2509.22186: MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing vision language modelefficient high2509decoupledresolution https://www.liquid.ai/blog/lfm2-vl-efficient-vision-language-models LFM2-VL: Efficient Vision-Language Models | Liquid AI Oct 21, 2025 - Today, we release LFM2-VL, our first series of vision-language foundation models. These multimodal models are designed for low-latency and device-aware... vision language modelsliquid ailfm2vlefficient https://sakana.ai/taid/ TAID: A Novel Method for Efficient Knowledge Transfer from Large Language Models to Small Language... TAID: A Novel Method for Efficient Knowledge Transfer from Large Language Models to Small Language Models large language modelsnovel methodknowledge transfertaidefficient https://arxiv.org/abs/2309.06180 [2309.06180] Efficient Memory Management for Large Language Model Serving with PagedAttention Abstract page for arXiv paper 2309.06180: Efficient Memory Management for Large Language Model Serving with PagedAttention large language modelefficient memory2309managementserving https://huggingface.co/papers/2309.06180 Paper page - Efficient Memory Management for Large Language Model Serving with PagedAttention Join the discussion on this paper page large language modelefficient memorypapermanagementserving https://arxiv.org/abs/2309.00071 [2309.00071] YaRN: Efficient Context Window Extension of Large Language Models Abstract page for arXiv paper 2309.00071: YaRN: Efficient Context Window Extension of Large Language Models large language modelsefficient context2309yarnwindow https://techcrunch.com/2023/07/11/fileread/ Backed by Gradient, Fileread uses large language models to make legal discovery more efficient |... Jul 12, 2023 - Legal discovery is one of the most time-consuming parts of litigation and typically involves team of specialists combing through towers of documents. Fileread,... large language modelsbackedgradientusesmake https://www.semanticscholar.org/reader/57e849d0de13ed5f91d086936296721d4ff75a75 [PDF] LLaMA: Open and Efficient Foundation Language Models | Semantic Scholar An academic search engine that utilizes artificial intelligence methods to provide highly relevant results and novel tools to filter them with ease. models semantic scholarpdfllamaopenefficient https://lelapa.ai/blog Blog — Insights on Language AI & Resource-Efficient Systems | Lelapa AI Insights on language AI, resource-efficient systems, and building technology for the Global South from the Lelapa AI team. language airesource efficientbloginsightssystems https://huggingface.co/papers/2309.00071 Paper page - YaRN: Efficient Context Window Extension of Large Language Models Join the discussion on this paper page large language modelsefficient contextpaperyarnwindow https://huggingface.co/papers/2403.13372 Paper page - LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models Join the discussion on this paper page efficient fine tuninglanguage modelspaperllamafactoryunified https://www.huaweicloud.com/intl/en-us/product/cloudbuild.html CodeArts Build – Multi-language – Efficient-HUAWEI CLOUD An easy-to-configure platform that supports multi-language parallel builds on the cloud. Its distributed acceleration helps enterprises improve build... multi languagehuawei cloudcodeartsbuildefficient https://arxiv.org/abs/2403.13372 [2403.13372] LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models Abstract page for arXiv paper 2403.13372: LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models efficient fine tuninglanguage models240313372llamafactory https://huggingface.co/papers/2509.22186 Paper page - MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document... Join the discussion on this paper page vision language modelefficient highpaper5decoupled https://www.amazon.science/publications/exploring-fine-tuning-for-in-context-retrieval-and-efficient-kv-caching-in-long-context-language-models Exploring fine-tuning for in-context retrieval and efficient KV-caching in long-context language... With context windows of millions of tokens, Long-Context Language Models (LCLMs) can encode entire document collections, offering a strong alternative to... fine tuningexploringcontextretrievalefficient