
威特
Witsy:桌面AI助手
3 years
Works with Finder
940
Github Watches
68
Github Forks
940
Github Stars
Downloads
Download Witsy from witsyai.com or from the releases page.
What is Witsy?
Witsy is a BYOK (Bring Your Own Keys) AI application: it means you need to have API keys for the LLM providers you want to use. Alternatively, you can use Ollama to run models locally on your machine for free and use them in Witsy.
Non-exhaustive feature list:
- OpenAI, Ollama, Anthropic, MistralAI, Google, xAI, Azure, OpenRouter, DeepSeek, Groq and Cerebras models supported
- Connect other providers (together, siliconflow, fireworks...) through OpenAI compatibility layer
- Chat completion with vision models support (describe an image)
- Text-to-image and text-to video with OpenAI, Google, xAI, Replicate, fal.ai and HuggingFace
- Image-to-image (image editing) and image-to-video with Google, Replicate and fal.ai
- Scratchpad to interactively create the best content with any model!
- Prompt anywhere allows to generate content directly in any application
- AI commands runnable on highlighted text in almost any application
- Experts prompts to specialize your bot on a specific topic
- LLM plugins to augment LLM: execute python code, search the Internet...
- Anthropic MCP server support
- Long-term memory plugin to increase relevance of LLM answers
- Read aloud of assistant messages (requires OpenAI or ElevenLabs API key)
- Read aloud of any text in other applications (requires OpenAI or ElevenLabs API key)
- Chat with your local files and documents (RAG)
- Transcription/Dictation (Speech-to-Text)
- Realtime Chat aka Voice Mode
- Anthropic Computer Use support
- Local history of conversations (with automatic titles)
- Formatting and copy to clipboard of generated code
- Conversation PDF export
- Image copy and download
Prompt Anywhere
Generate content in any application:
- From any editable content in any application
- Hit the Prompt anywhere shortcut (Shift+Control+Space / ^⇧Space)
- Enter your prompt in the window that pops up
- Watch Witsy enter the text directly in your application!
On Mac, you can define an expert that will automatically be triggered depending on the foreground application. For instance, if you have an expert used to generate linux commands, you can have it selected if you trigger Prompt Anywhere from the Terminal application!
AI Commands
AI commands are quick helpers accessible from a shortcut that leverage LLM to boost your productivity:
- Select any text in any application
- Hit the AI command shorcut (Alt+Control+Space / ⌃⌥Space)
- Select one of the commands and let LLM do their magic!
You can also create custom commands with the prompt of your liking!
Commands inspired by https://the.fibery.io/@public/Public_Roadmap/Roadmap_Item/AI-Assistant-via-ChatGPT-API-170.
Experts
From https://github.com/f/awesome-chatgpt-prompts.
Scratchpad
https://www.youtube.com/watch?v=czcSbG2H-wg
Chat with your documents (RAG)
You can connect each chat with a document repository: Witsy will first search for relevant documents in your local files and provide this info to the LLM. To do so:
- Click on the database icon on the left of the prompt
- Click Manage and then create a document repository
- OpenAI Embedding require on API key, Ollama requires an embedding model
- Add documents by clicking the + button on the right hand side of the window
- Once your document repository is created, click on the database icon once more and select the document repository you want to use. The icon should turn blue
Transcription / Dictation (Speech-to-Text)
You can transcribe audio recorded on the microphone to text. Transcription can be done using OpenAI Whisper online model (requires API key) or using local Whisper model (requires download of large files). Once the text is transcribed you can:
- Copy it to your clipboard
- Insert it in the application that was running before you activated the dictation
Anthropic Computer Use
https://www.youtube.com/watch?v=vixl7I07hBk
Setup
You can download a binary from from witsyai.com, from the releases page or build yourself:
npm install
npm start
Prerequisites
To use OpenAI, Anthropic, Google or Mistral AI models, you need to enter your API key:
To use Ollama models, you need to install Ollama and download some models.
To use text-to-speech, you need an OpenAI API key.
To use Internet search you need a Tavily API key.
TODO
- Proper database (SQLite3) storage (??)
WIP
DONE
- Azure AI support
- Brave Search plugin
- Allow user-input models for embeddings
- User defined parameters for custom engines
- Direct speech-to-text checbox
- Quick access buttons on home
- fal.ai support (speech-to-text, text-to-image and text-to-video)
- Debug console
- Design Studio
- i18n
- Mermaid diagram rendering
- Smithery.ai MCP integration
- Model Context Protocol
- Local Web Search
- Model defaults
- Speech-to-text language
- Model parameters (temperature...)
- Favorite models
- ElevenLabs Text-to-Speech
- Custom engines (OpenAI compatible)
- Long-term memory plugin
- OpenRouter support
- DeepSeek support
- Folder mode
- All instructions customization
- Fork chat (with optional LLM switch)
- Realtime chat
- Replicate video generation
- Together.ai compatibility
- Gemini 2.0 Flash support
- Groq LLama 3.3 support
- xAI Grok Vision Model support
- Ollama function-calling
- Replicate image generation
- AI Commands redesign
- Token usage report
- OpenAI o1 models support
- Groq vision support
- Image resize option
- Llama 3.2 vision support
- YouTube plugin
- RAG in Scratchpad
- Hugging face image generation
- Show prompt used for image generation
- Redesigned Prompt window
- Anthropic Computer Use
- Auto-update refactor (still not Windows)
- Dark mode
- Conversation mode
- Google function calling
- Anthropic function calling
- Scratchpad
- Dictation: OpenAI Whisper + Whisper WebGPU
- Auto-select expert based on foremost app (Mac only)
- Cerebras support
- Local files RAG
- Groq model update (8-Sep-2024)
- PDF Export of chats
- Prompts renamed to Experts. Now editable.
- Read aloud
- Import/Export commands
- Anthropic Sonnet 3.5
- Ollama base URL as settings
- OpenAI base URL as settings
- DALL-E as tool
- Google Gemini API
- Prompt anywhere
- Cancel commands
- GPT-4o support
- Different default engine/model for commands
- Text attachments (TXT, PDF, DOCX, PPTX, XLSX)
- MistralAI function calling
- Auto-update
- History date sections
- Multiple selection delete
- Search
- Groq API
- Custom prompts
- Sandbox & contextIsolation
- Application Menu
- Prompt history navigation
- Ollama model pull
- macOS notarization
- Fix when long text is highlighted
- Shortcuts for AI commands
- Shift to switch AI command behavior
- User feedback when running a tool
- Download internet content plugin
- Tavily Internet search plugin
- Python code execution plugin
- LLM Tools supprt (OpenAI only)
- Mistral AI API integration
- Latex rendering
- Anthropic API integration
- Image generation as b64_json
- Text-to-speech
- Log file (electron-log)
- Conversation language settings
- Paste image in prompt
- Run commands with default models
- Models refresh
- Edit commands
- Customized commands
- Conversation menu (info, save...)
- Conversation depth setting
- Save attachment on disk
- Keep running in system tray
- Nicer icon (still temporary)
- Rename conversation
- Copy/edit messages
- New chat window for AI command
- AI Commands with shortcut
- Auto-switch to vision model
- Run at login
- Shortcut editor
- Chat font size settings
- Image attachment for vision
- Stop response streaming
- Save/Restore window position
- Ollama support
- View image full screen
- Status/Tray bar icon + global shortcut to invoke
- Chat themes
- Default instructions in settings
- Save DALL-E images locally (and delete properly)
- OpenAI links in settings
- Copy code button
- Chat list ordering
- OpenAI model choice
- CSS variables
相关推荐
😎简单易用、🧩丰富生态 -大模型原生即时通信机器人平台| 适配QQ / 微信(企业微信、个人微信) /飞书 /钉钉 / discord / telegram / slack等平台| 支持chatgpt,deepseek,dify,claude,基于LLM的即时消息机器人平台,支持Discord,Telegram,微信,Lark,Dingtalk,QQ,Slack
📦ropomix(以前是repopack)是一种功能强大的工具,可将整个存储库包装到一个单一的AI友好文件中。非常适合当您需要将代码库提供给大型语言模型(LLMS)或其他AI工具(例如Claude,Chatgpt,Deepseek,Cllexity,Gemini,Gemini,Gemma,Gemma,Lllama,Grok等)等其他AI工具。
Reviews

user_CA43Kj9A
As a dedicated MCP application user, I recently discovered "Witsy" by nbonamy, and it has been a game-changer! This product is user-friendly and integrates seamlessly into my existing workflow. It's clear that a lot of thought and effort has gone into making "Witsy" a reliable and efficient tool. I highly recommend it to anyone looking for a robust solution to enhance their productivity.

user_SUAX37yb
I've been using the Witsy app created by nbonamy for a while now, and I must say it's fantastic! The interface is user-friendly, and the functionality is top-notch. It's clear that a lot of thought has gone into making this a seamless experience. Highly recommended for anyone looking for an efficient and reliable tool.

user_RGocHoXE
Witsy by nbonamy is an exceptional MCP application that has transformed my workflow. Its intuitive design and robust features have made task management a breeze. I highly recommend Witsy to anyone looking to streamline their processes and enhance productivity. Its seamless integration and user-friendly interface stand out, making it a must-have tool for professionals and enthusiasts alike.

user_kTX7yKOu
As a dedicated MCP application user, I find Witsy by nbonamy to be an exceptional tool. Its intuitive interface and seamless functionality make it perfect for anyone looking to streamline their workflows. The attention to detail and user experience is top-notch. Highly recommended!

user_MjI2HLuT
Witsy by nbonamy is a fantastic tool for MCP app users. Its sleek design and user-friendly interface make it easy to navigate. Highly recommended!

user_60FWxYHQ
I have been using witsy by nbonamy and it has truly exceeded my expectations. The intuitive design and seamless user experience make it a standout application. As a loyal MCP application user, I highly recommend witsy for its innovative features and reliability. This product has significantly improved my productivity and efficiency.

user_3Kk5ZKk0
Witsy is an innovative MCP application created by nbonamy. It offers a user-friendly interface and robust functionality, making it a standout tool for managing your content seamlessly. The easy-to-navigate URL structure and welcoming information make the onboarding process smooth and delightful. Highly recommended for anyone needing a reliable content management solution!

user_lEk0gmrd
Witsy by nbonamy is a fantastic tool that has significantly enhanced my workflow. Its intuitive design and robust features make it a must-have for anyone looking to boost their productivity. Whether you're a novice or an experienced user, Witsy is easy to use and highly efficient. I highly recommend giving it a try!

user_7EvjI7IY
I've been using Witsy by nbonamy for a while and I must say, it's an incredibly intuitive tool. Its seamless integration and user-friendly interface make it a breeze to navigate. Highly recommend it for anyone looking for efficiency and reliability in their workflows!

user_cFdA9Fa1
I've been using witsy by nbonamy and it's been a game-changer! The intuitive design and seamless integration truly stand out. Highly recommend it for anyone looking to enhance their productivity.