https://www.minfytech.com/case-studies/darwinbox
Empowering Darwinbox's AI Model Inference with Scalability and Efficiency
s aimodel inferenceempoweringdarwinboxscalability
https://tldr.takara.ai/p/2510.18672
Reasoning Language Model Inference Serving Unveiled: An Empirical Study | Takara TLDR
The reasoning large language model (RLLM) has been proven competitive in solving complex reasoning tasks such as mathematics, coding, compared to general LLM...
language modelreasoninginferenceserving
https://www.together.ai/dedicated-model-inference
Dedicated Model Inference | Together AI
Deploy models on dedicated inference endpoints engineered for speed, control, and best-in-class unit economics — backed by Together's frontier AI research.
model inferencededicatedtogetherai
https://www.microsoft.com/en-us/research/blog/deepspeed-accelerating-large-scale-model-inference-and-training-via-system-optimizations-and-compression/
DeepSpeed: Accelerating large-scale model inference and training via system optimizations and...
Nov 1, 2022 - Last month, the DeepSpeed Team announced ZeRO-Infinity, a step forward in training models with tens of trillions of parameters. In addition to creating...
large scalemodel inferencedeepspeedaccelerating
https://docs.instill-ai.com/reference/modelpublicservice_triggerasyncnamespacemodel
Trigger model inference asynchronously
Triggers a deployed model to infer the result of a set of task or questions.
model inferencetrigger
https://aws.amazon.com/solutions/guidance/low-latency-high-throughput-model-inference-using-amazon-sagemaker/
Guidance for Low-Latency, High Throughput Model Inference Using Amazon SageMaker
This Guidance shows how to use Amazon SageMaker to support high-throughput model inferencing workloads like programmatic advertising and real-time bidding...
low latencyhigh throughputmodel inferenceguidance
https://papers.neurips.cc/paper_files/paper/2025/hash/088463cd3126aef2002ffc69da42ec59-Abstract-Conference.html
Efficient Large Language Model Inference with Neural Block Linearization
large language modelefficientinferenceneuralblock
https://careers.redpoint.com/companies/abridge/jobs/55562031-senior-staff-machine-learning-systems-engineer
Machine Learning Infrastructure Engineer- Model Inference @ Abridge | Redpoint Ventures Job Board
Search job openings across the Redpoint Ventures network.
machine learninginfrastructure engineermodel inferenceredpoint ventures
https://www.fractile.ai/
Fractile - Radically Accelerate Frontier Model Inference
Fractile is designing AI compute systems that will enable the next generation of AI scaling: frontier model inference, 25x faster, at 1/10th the cost.
fractileradicallyacceleratefrontiermodel
https://cohere.com/solutions/model-vault
Model Vault | Dedicated Model Inference Platform | Cohere
Model Vault is a fully managed inference platform for Cohere models, giving enterprises the advantages of self-hosted AI without the operational overhead.
model vaultdedicated inferenceplatformcohere
https://docs.feast.dev/v0.62-branch/getting-started/architecture/model-inference
Feature Serving and Model Inference | v0.62-branch | Feast: the Open Source Feature Store
https://uva.sowiso.nl/courses/theory/74/428/7163/en
Inference about the Slope of a Linear Model
about theinferenceslopelinearmodel
https://docs.ray.io/en/latest/ray-overview/examples/e2e-timeseries/e2e_timeseries/02-Validation.html
DLinear model validation using offline batch inference — Ray 2.55.1
model validationbatch inference
https://scads.ai/theses/privacy-in-deep-learning-investigating-the-impact-of-model-compression-on-membership-inference-vulnerability/
Privacy in Deep Learning: Investigating the Impact of Model Compression on Membership Inference...
https://soc.washington.edu/research/publications/narratives-numbers-valid-inference-using-language-model-predictions-verbal
From Narratives to Numbers: Valid Inference Using Language Model Predictions from Verbal Autopsies...
In settings where most deaths occur outside the healthcare system, verbal autopsies (VAs) are a common tool to monitor trends in causes of death (COD). VAs are...