Foundry Local enables developers to run AI models directly on their own hardware for offline scenarios, edge computing, and low-latency applications. The v1.1 and v1.2 releases (early 2026) significantly expanded platform and model support.
| Feature | Details |
|---|---|
| Linux ARM64 Support | Run Foundry Local on ARM64-based Linux devices (Raspberry Pi, NVIDIA Jetson, etc.) |
| Live Audio Transcription | Real-time speech-to-text processing directly on-device |
| Text Embeddings | Generate vector embeddings locally for offline RAG pipelines |
| Qwen 3.5 Vision Support | Run Qwen 3.5 Vision model locally for on-device multimodal inference |
| ONNX Runtime 1.26 | Latest ONNX Runtime for optimized model execution across hardware |
Foundry Local provides SDKs for Python, JavaScript, C#, and Rust, making it accessible across a wide range of development ecosystems.