ollama/server
Jeffrey Morgan 15c2d8fe14
server: parallelize embeddings in API web handler instead of in subprocess runner (#6220)
For simplicity, perform parallelization of embedding requests in the API handler instead of offloading this to the subprocess runner. This keeps the scheduling story simpler as it builds on existing parallel requests, similar to existing text completion functionality.
2024-08-11 11:57:10 -07:00
..
testdata/tools server: collect nested tool call objects when parsing (#5824) 2024-07-22 12:38:03 -04:00
auth.go fix nil deref in auth.go 2024-07-26 14:14:48 -07:00
download.go Merge pull request #6258 from coolljt0725/fix_typo 2024-08-09 17:19:48 -07:00
fixblobs.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
fixblobs_test.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
images.go manifest: Store layers inside manifests consistently as values. 2024-08-07 17:03:06 -07:00
layer.go manifest: Store layers inside manifests consistently as values. 2024-08-07 17:03:06 -07:00
manifest.go manifest: Store layers inside manifests consistently as values. 2024-08-07 17:03:06 -07:00
manifest_test.go lint 2024-08-01 17:06:06 -07:00
model.go manifest: Store layers inside manifests consistently as values. 2024-08-07 17:03:06 -07:00
model_test.go lint 2024-08-01 17:06:06 -07:00
modelpath.go models 2024-07-22 11:26:12 -07:00
modelpath_test.go uint64 2024-07-22 11:49:02 -07:00
prompt.go tools 2024-07-15 15:26:16 -07:00
prompt_test.go lint 2024-08-01 17:06:06 -07:00
routes.go server: parallelize embeddings in API web handler instead of in subprocess runner (#6220) 2024-08-11 11:57:10 -07:00
routes_create_test.go use testing tempdirs 2024-08-02 16:04:06 -07:00
routes_delete_test.go manifest: Store layers inside manifests consistently as values. 2024-08-07 17:03:06 -07:00
routes_generate_test.go comments 2024-07-31 15:58:55 -07:00
routes_list_test.go lint 2024-08-01 17:06:06 -07:00
routes_test.go lint 2024-08-01 17:06:06 -07:00
sched.go lint 2024-08-01 17:06:06 -07:00
sched_test.go server: parallelize embeddings in API web handler instead of in subprocess runner (#6220) 2024-08-11 11:57:10 -07:00
sparse_common.go Don't hard fail on sparse setup error 2024-08-09 12:16:19 -07:00
sparse_windows.go Don't hard fail on sparse setup error 2024-08-09 12:16:19 -07:00
upload.go manifest: Store layers inside manifests consistently as values. 2024-08-07 17:03:06 -07:00