ollama

Author	SHA1	Message	Date
Daniel Hiltgen	830fdd2715	Better explain multi-gpu behavior	2024-07-23 15:16:38 -07:00
royjhan	c0648233f2	api embed docs (#5282 )	2024-07-22 13:37:08 -07:00
Daniel Hiltgen	283948c83b	Adjust windows ROCm discovery The v5 hip library returns unsupported GPUs which wont enumerate at inference time in the runner so this makes sure we align discovery. The gfx906 cards are no longer supported so we shouldn't compile with that GPU type as it wont enumerate at runtime.	2024-07-20 15:17:50 -07:00
royjhan	0d41623b52	OpenAI: Add Suffix to `v1/completions` (#5611 ) * add suffix * remove todo * remove TODO * add to test * rm outdated prompt tokens info md * fix test * fix test	2024-07-16 20:50:14 -07:00
Daniel Hiltgen	1f50356e8e	Bump ROCm on windows to 6.1.2 This also adjusts our algorithm to favor our bundled ROCm. I've confirmed VRAM reporting still doesn't work properly so we can't yet enable concurrency by default.	2024-07-10 11:01:22 -07:00
Jeffrey Morgan	8f8e736b13	update llama.cpp submodule to `d7fd29f` (#5475 )	2024-07-05 13:25:58 -04:00
Daniel Hiltgen	52abc8acb7	Document older win10 terminal problems We haven't found a workaround, so for now recommend updating.	2024-07-03 17:32:14 -07:00
Daniel Hiltgen	ef757da2c9	Better nvidia GPU discovery logging Refine the way we log GPU discovery to improve the non-debug output, and report more actionable log messages when possible to help users troubleshoot on their own.	2024-07-03 10:50:40 -07:00
Daniel Hiltgen	d2f19024d0	Merge pull request #5442 from dhiltgen/concurrency_docs Add windows radeon concurrency note	2024-07-02 12:47:47 -07:00
Daniel Hiltgen	69c04eecc4	Add windows radeon concurreny note	2024-07-02 12:46:14 -07:00
royjhan	996bb1b85e	OpenAI: /v1/models and /v1/models/{model} compatibility (#5007 ) * OpenAI v1 models * Refactor Writers * Add Test Co-Authored-By: Attila Kerekes * Credit Co-Author Co-Authored-By: Attila Kerekes <439392+keriati@users.noreply.github.com> * Empty List Testing * Use Namespace for Ownedby * Update Test * Add back envconfig * v1/models docs * Use ModelName Parser * Test Names * Remove Docs * Clean Up * Test name Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Add Middleware for Chat and List * Testing Cleanup * Test with Fatal * Add functionality to chat test * OpenAI: /v1/models/{model} compatibility (#5028) * Retrieve Model * OpenAI Delete Model * Retrieve Middleware * Remove Delete from Branch * Update Test * Middleware Test File * Function name * Cleanup * Test Update * Test Update --------- Co-authored-by: Attila Kerekes <439392+keriati@users.noreply.github.com> Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-07-02 11:50:56 -07:00
Daniel Hiltgen	dfded7e075	Merge pull request #5364 from dhiltgen/concurrency_docs Document concurrent behavior and settings	2024-07-01 09:49:48 -07:00
Eduard	27402cb7a2	Update gpu.md (#5382 ) Runs fine on a NVIDIA GeForce GTX 1050 Ti	2024-06-30 21:48:51 -04:00
Jeffrey Morgan	c1218199cf	Update api.md	2024-06-29 16:22:49 -07:00
Daniel Hiltgen	aae56abb7c	Document concurrent behavior and settings	2024-06-28 13:15:57 -07:00
royjhan	6d4219083c	Update docs (#5312 )	2024-06-28 09:58:14 -07:00
royjhan	fedf71635e	Extend api/show and ollama show to return more model info (#4881 ) * API Show Extended * Initial Draft of Information Co-Authored-By: Patrick Devine <pdevine@sonic.net> * Clean Up * Descriptive arg error messages and other fixes * Second Draft of Show with Projectors Included * Remove Chat Template * Touches * Prevent wrapping from files * Verbose functionality * Docs * Address Feedback * Lint * Resolve Conflicts * Function Name * Tests for api/show model info * Show Test File * Add Projector Test * Clean routes * Projector Check * Move Show Test * Touches * Doc update --------- Co-authored-by: Patrick Devine <pdevine@sonic.net>	2024-06-19 14:19:02 -07:00
Daniel Hiltgen	9d8a4988e8	Implement log rotation for tray app	2024-06-19 12:53:34 -07:00
Jeffrey Morgan	176d0f7075	Update import.md	2024-06-17 19:44:14 -04:00
Jeffrey Morgan	c7b77004e3	docs: add missing powershell package to windows development instructions (#5075 ) * docs: add missing instruction for powershell build The powershell script for building Ollama on Windows now requires the `ThreadJob` module. Add this to the instructions and dependency list. * Update development.md	2024-06-15 23:08:09 -04:00
Jeffrey Morgan	6b800aa7b7	openai: do not set temperature to 0 when setting seed (#5045 )	2024-06-14 13:43:56 -07:00
Patrick Devine	4dc7fb9525	update 40xx gpu compat matrix (#5036 )	2024-06-13 17:10:33 -07:00
Jeffrey Morgan	ead259d877	llm: fix seed value not being applied to requests (#4986 )	2024-06-11 14:24:41 -07:00
Michael Yang	5bc029c529	Merge pull request #4921 from ollama/mxyng/import-md update import.md	2024-06-10 11:41:09 -07:00
Napuh	896495de7b	Add instructions to easily install specific versions on faq.md (#4084 ) * Added instructions to easily install specific versions on faq.md * Small typo * Moved instructions on how to install specific version to linux.md * Update docs/linux.md * Update docs/linux.md --------- Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-06-09 10:49:03 -07:00
Jeffrey Morgan	943172cbf4	Update api.md	2024-06-08 23:04:32 -07:00
Michael Yang	b9ce7bf75e	update import.md	2024-06-07 16:45:15 -07:00
royjhan	28c7813ac4	API PS Documentation (#4822 ) * API PS Documentation	2024-06-05 11:06:53 -07:00
Shubham	60323e0805	add embed model command and fix question invoke (#4766 ) * add embed model command and fix question invoke * Update docs/tutorials/langchainpy.md Co-authored-by: Kim Hallberg <hallberg.kim@gmail.com> * Update docs/tutorials/langchainpy.md --------- Co-authored-by: Kim Hallberg <hallberg.kim@gmail.com> Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-06-03 22:20:48 -07:00
Daniel Hiltgen	0fc0cfc6d2	Merge pull request #4594 from dhiltgen/doc_container_workarounds Add isolated gpu test to troubleshooting	2024-05-30 13:10:54 -07:00
Daniel Hiltgen	1b2d156094	Tidy up developer guide a little	2024-05-23 15:14:05 -07:00
Daniel Hiltgen	f77713bf1f	Add isolated gpu test to troubleshooting	2024-05-23 09:33:25 -07:00
Patrick Devine	3bade04e10	doc updates for the faq/troubleshooting (#4565 )	2024-05-21 15:30:09 -07:00
alwqx	8800c8a59b	chore: fix typo in docs (#4536 )	2024-05-20 14:19:03 -07:00
Patrick Devine	f1548ef62d	update the FAQ to be more clear about windows env variables (#4415 )	2024-05-13 18:01:13 -07:00
睡觉型学渣	9c76b30d72	Correct typos. (#4387 ) * Correct typos. * Correct typos.	2024-05-12 18:21:11 -07:00
Daniel Hiltgen	8cc0ee2efe	Doc container usage and workaround for nvidia errors	2024-05-09 09:26:45 -07:00
Jeffrey Morgan	d5eec16d23	use model defaults for `num_gqa`, `rope_frequency_base` and `rope_frequency_scale` (#1983 )	2024-05-09 09:06:13 -07:00
Carlos Gamez	daa1a032f7	Update langchainjs.md (#2027 ) Updated sample code as per warning notification from the package maintainers	2024-05-08 20:21:03 -07:00
boessu	5d3f7fff26	Update langchainpy.md (#4236 ) fixing pip code.	2024-05-07 16:36:34 -07:00
CrispStrobe	7c5330413b	note on naming restrictions (#2625 ) * note on naming restrictions else push would fail with cryptic retrieving manifest Error: file does not exist ==> maybe change that in code too * Update docs/import.md --------- Co-authored-by: C-4-5-3 <154636388+C-4-5-3@users.noreply.github.com> Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-05-06 16:03:21 -07:00
Jeffrey Chen	d091fe3c21	Windows automatically recognizes username (#3214 )	2024-05-06 15:03:14 -07:00
Mohamed A. Fouad	ee02f548c8	Update linux.md (#3847 ) Add -e to viewing logs in order to show end of ollama logs	2024-05-06 15:02:25 -07:00
Darinka	3ecae420ac	Update api.md (#3945 ) * Update api.md Changed the calculation of tps (token/s) in the documentation * Update docs/api.md --------- Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-05-06 14:39:58 -07:00
Adrien Brault	aa93423fbf	docs: pbcopy on mac (#3129 )	2024-05-06 13:47:00 -07:00
Hyden Liu	fb8ddc564e	chore: delete `HEAD` (#4194 )	2024-05-06 10:32:30 -07:00
Daniel Hiltgen	20f6c06569	Make maximum pending request configurable This also bumps up the default to be 50 queued requests instead of 10.	2024-05-04 21:00:52 -07:00
Daniel Hiltgen	e006480e49	Explain the 2 different windows download options	2024-05-04 12:50:05 -07:00
Dr Nic Williams	e8aaea030e	Update 'llama2' -> 'llama3' in most places (#4116 ) * Update 'llama2' -> 'llama3' in most places --------- Co-authored-by: Patrick Devine <patrick@infrahq.com>	2024-05-03 15:25:04 -04:00
Michael Yang	94c369095f	fix line ending replace CRLF with LF	2024-05-02 14:53:13 -07:00

1 2 3 4 5 ...

318 commits