ollama

Author	SHA1	Message	Date
Daniel Hiltgen	540f4af45f	Wire up more complete CI for releases Flesh out our github actions CI so we can build official releaes.	2024-03-15 12:37:36 -07:00
Blake Mizerany	6ce37e4d96	llm,readline: use errors.Is instead of simple == check (#3161 ) This fixes some brittle, simple equality checks to use errors.Is. Since go1.13, errors.Is is the idiomatic way to check for errors. Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-03-15 07:14:12 -07:00
Blake Mizerany	703684a82a	server: replace blob prefix separator from ':' to '-' (#3146 ) This fixes issues with blob file names that contain ':' characters to be rejected by file systems that do not support them.	2024-03-14 20:18:06 -07:00
Daniel Hiltgen	6459377ae0	Add ROCm support to linux install script (#2966 )	2024-03-14 18:00:16 -07:00
Blake Mizerany	8546dd3d72	.github: fix model and feature request yml (#3155 )	2024-03-14 15:26:06 -07:00
Blake Mizerany	87100be5e0	.github: add issue templates (#3143 )	2024-03-14 15:19:10 -07:00
Michael Yang	e87c780ff9	Merge pull request #3149 from ollama/mxyng/fix-memory-leak fix: clip memory leak	2024-03-14 13:34:15 -07:00
Michael Yang	291c663865	fix: clip memory leak	2024-03-14 13:12:42 -07:00
Daniel Hiltgen	da20786e3e	Merge pull request #3068 from dhiltgen/win_pipe Use stdin for term discovery on windows	2024-03-14 11:55:19 -07:00
Jeffrey Morgan	5ce997a7b9	Update README.md	2024-03-13 21:12:17 -07:00
Jeffrey Morgan	672ffe9b7d	add `OLLAMA_KEEP_ALIVE` to environment variable docs for `ollama serve` (#3127 )	2024-03-13 14:35:33 -07:00
Patrick Devine	47cfe58af5	Default Keep Alive environment variable (#3094 ) --------- Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>	2024-03-13 13:29:40 -07:00
Daniel Hiltgen	c1a81c6fe3	Use stdin for term discovery on windows When you feed input to the cmd via a pipe it no longer reports a warning	2024-03-13 10:37:31 -07:00
Jeffrey Morgan	e72c567cfd	restore locale patch (#3091 )	2024-03-12 22:08:13 -07:00
Bruce MacDonald	3e22611200	token repeat limit for prediction requests (#3080 )	2024-03-12 22:08:25 -04:00
Daniel Hiltgen	a54d4a28dc	Merge pull request #3088 from dhiltgen/rocm_igpu_linux Fix iGPU detection for linux	2024-03-12 17:20:27 -07:00
Daniel Hiltgen	82b0c7c27e	Fix iGPU detection for linux This fixes a few bugs in the new sysfs discovery logic. iGPUs are now correctly identified by their <1G VRAM reported. the sysfs IDs are off by one compared to what HIP wants due to the CPU being reported in amdgpu, but HIP only cares about GPUs.	2024-03-12 16:57:19 -07:00
Patrick Devine	ba7cf7fb66	add more docs on for the modelfile message command (#3087 )	2024-03-12 16:41:41 -07:00
Bruce MacDonald	2f804068bd	warn when json format is expected but not mentioned in prompt (#3081 )	2024-03-12 19:07:11 -04:00
Daniel Hiltgen	34d00f90b1	Merge pull request #3070 from dhiltgen/visible_devices Add docs explaining GPU selection env vars	2024-03-12 11:36:46 -07:00
Daniel Hiltgen	b53229a2ed	Add docs explaining GPU selection env vars	2024-03-12 11:33:06 -07:00
racerole	53c107e20e	chore: fix typo (#3073 ) Signed-off-by: racerole <jiangyifeng@outlook.com>	2024-03-12 14:09:22 -04:00
mofanke	51578d8573	fix gpu_info_cuda.c compile warning (#3077 )	2024-03-12 14:08:40 -04:00
Jeffrey Morgan	b5fcd9d3aa	use `-trimpath` when building releases (#3069 )	2024-03-11 15:58:46 -07:00
Bruce MacDonald	b80661e8c7	relay load model errors to the client (#3065 )	2024-03-11 16:48:27 -04:00
Jeffrey Morgan	6d3adfbea2	Update troubleshooting.md	2024-03-11 13:22:28 -07:00
Jeffrey Morgan	369eda65f5	update llama.cpp submodule to `ceca1ae` (#3064 )	2024-03-11 12:57:48 -07:00
Michael Yang	f878e91070	Merge pull request #3044 from ollama/mxyng/fix-convert-shape convert: fix shape	2024-03-11 09:56:57 -07:00
Daniel Hiltgen	0d651478e4	Merge pull request #3056 from dhiltgen/rocm_link_clash Avoid rocm runner and dependency clash	2024-03-11 09:48:48 -07:00
Michael Yang	9ea492f1ce	convert: fix shape	2024-03-11 09:41:01 -07:00
Daniel Hiltgen	bc13da2bfe	Avoid rocm runner and dependency clash Putting the rocm symlink next to the runners is risky. This moves the payloads into a subdir to avoid potential clashes.	2024-03-11 09:33:22 -07:00
Jeffrey Morgan	41b00b9856	fix `03-locale.diff`	2024-03-10 16:21:05 -07:00
Daniel Hiltgen	c2a8ed48e7	Merge pull request #3048 from dhiltgen/harden_rocm_deps Harden for deps file being empty (or short)	2024-03-10 15:17:22 -07:00
Daniel Hiltgen	3dc1bb6a35	Harden for deps file being empty (or short)	2024-03-10 14:45:38 -07:00
Daniel Hiltgen	7865a6996a	Merge pull request #3046 from dhiltgen/rocm_search_paths Add ollama executable peer dir for rocm	2024-03-10 12:30:56 -07:00
Daniel Hiltgen	00ec269321	Add ollama executable peer dir for rocm This allows people who package up ollama on their own to place the rocm dependencies in a peer directory to the ollama executable much like our windows install flow.	2024-03-10 12:16:30 -07:00
Jeffrey Morgan	908005d90b	patch: use default locale in wpm tokenizer (#3034 )	2024-03-09 21:12:12 -08:00
Jeffrey Morgan	cdf65e793f	only copy deps for `amd64` in `build_linux.sh`	2024-03-09 17:55:22 -08:00
Daniel Hiltgen	82ca694d68	Rename ROCm deps file to avoid confusion (#3025 )	2024-03-09 17:48:38 -08:00
Jeffrey Morgan	5017a15bcb	add `macapp` to `.dockerignore`	2024-03-09 16:07:06 -08:00
Jeffrey Morgan	e11668aa07	add `bundle_metal` and `cleanup_metal` funtions to `gen_darwin.sh`	2024-03-09 16:04:57 -08:00
Jeffrey Morgan	0bd0f4a29c	tidy cleanup logs	2024-03-09 15:56:48 -08:00
Jeffrey Morgan	1ffb1e2874	update llama.cpp submodule to `77d1ac7` (#3030 )	2024-03-09 15:55:34 -08:00
Daniel Hiltgen	0a7844413c	Merge pull request #3026 from dhiltgen/win_rocm_docs Doc how to set up ROCm builds on windows	2024-03-09 14:17:19 -08:00
Jeffrey Morgan	f9cd55c70b	disable gpu for certain model architectures and fix divide-by-zero on memory estimation	2024-03-09 12:51:38 -08:00
Daniel Hiltgen	0fdebb34a9	Doc how to set up ROCm builds on windows	2024-03-09 11:29:45 -08:00
Daniel Hiltgen	ac64cd4ef9	Merge pull request #3008 from dhiltgen/no_more_idempotent Finish unwinding idempotent payload logic	2024-03-09 09:13:24 -08:00
Daniel Hiltgen	4a5c9b8035	Finish unwinding idempotent payload logic The recent ROCm change partially removed idempotent payloads, but the ggml-metal.metal file for mac was still idempotent. This finishes switching to always extract the payloads, and now that idempotentcy is gone, the version directory is no longer useful.	2024-03-09 08:34:39 -08:00
Jeffrey Morgan	efe5617b64	update llama.cpp submodule to `c2101a2` (#3020 )	2024-03-09 00:44:50 -08:00
Jeffrey Morgan	5b3fad9636	separate out `isLocalIP`	2024-03-09 00:22:08 -08:00

1 2 3 4 5 ...

2186 commits