Featured
12 tools found for "llama"
Highly optimized LLM inference engine in pure C++
Run open-source LLMs locally on your own machine.
AI inference on wafer-scale chips — 1000+ tokens/second
Fast, affordable inference for open-source AI models
Open-source AI code assistant extension for VS Code and JetBrains with any model support.
Multi-model AI chat platform giving access to GPT-4, Claude, Gemini, and hundreds of AI bots in one place.
Open-source AI chat interface by Hugging Face giving access to leading open-weight language models.
Meta's free AI assistant integrated across apps