generate
Do not build AVX runners on ARM64
2024-04-26 23:55:32 -06:00
llama.cpp @ 952d03dbea
update llama.cpp commit to 952d03d
2024-04-30 17:31:20 -04:00
ggla.go
refactor tensor query
2024-04-10 11:37:20 -07:00
ggml.go
fix: mixtral graph
2024-04-22 17:19:44 -07:00
gguf.go
fixes for gguf ( #3863 )
2024-04-23 20:57:20 -07:00
llm.go
Add import declaration for windows,arm64 to llm.go
2024-04-26 23:23:53 -06:00
llm_linux.go
Switch back to subprocessing for llama.cpp
2024-04-01 16:48:18 -07:00
memory.go
Centralize server config handling
2024-05-05 16:49:50 -07:00
server.go
Use our libraries first
2024-05-06 14:23:29 -07:00
status.go
Switch back to subprocessing for llama.cpp
2024-04-01 16:48:18 -07:00