- prompt cache causes inferance to hang after some time
Make sure we're building an x86 ext_server lib when cross-compiling
This switches darwin to dynamic loading, and refactors the code now that no static linking of the library is used on any platform