Llama.cppCode & DevelopmentHighly optimized LLM inference engine in pure C++inferencec++quantizationOpen Source4.6Visit