頻率 | Frequency Feed — AI 模型、產品、工具與開源動態

Apple Silicon 優化的 LLM 推理伺服器

jundot/omlx

專為 Apple Silicon 設計的 LLM 推理伺服器，支援連續批次處理和 SSD 緩存，可透過 macOS 選單列輕鬆管理。

⭐ 16,600Python06/14

apple-siliconinference-serverllmmacos

RunanywhereAI/runanywhere-sdks

一套生產就緒的 SDK，讓開發者能在 iOS、Android、Web 等平台上本地運行 LLM、語音和圖像生成 AI。

⭐ 10,334C++06/14

androidapple-intelligencecppdiffusion-models

alibaba/MNN

MNN 為阿里巴巴推出的輕量級推理引擎，專為邊緣裝置提供高效能的 LLM 推理與 AI 應用支援。

⭐ 15,480C++06/12

armconvolutiondeep-learningembedded-devices