Winky

Your cute voice assistant powered by AI

⭐ Star this repository if it helped you! ⭐

About Winky

Winky is a cross-platform desktop voice assistant that helps you quickly convert voice to text and run intelligent LLM-powered actions. With a convenient floating microphone overlay, you can interact with Winky from anywhere on your screen, making it perfect for productivity workflows.

Winky supports both cloud-based and local AI processing, giving you the flexibility to choose between speed and privacy. Whether you need quick voice commands, transcription, or AI-powered responses, Winky is ready to help.

Google Chrome Extension:

Extension repository: https://github.com/Artasov/winky-ext
Chrome Web Store: https://chromewebstore.google.com/detail/winky/mpinlhhkmpljjlcekiocnglfbfpamkjl

About This Repository

This repository contains the source code for Winky, a cross-platform desktop application built with Tauri + React + Vite. The application provides a modern, efficient voice assistant experience with support for multiple AI providers and local processing options.

🚀 Key Features

FREE USAGE - no subscription required, no limits for local processing
Voice Recognition - advanced speech-to-text conversion with multiple AI models
LLM Processing - intelligent AI-powered actions and responses
Floating Microphone - convenient floating microphone overlay for quick access
Local Speech Recognition - use local AI models for faster processing and privacy
Quick Actions - customizable hotkeys and actions for productivity
Privacy & Security - all data processed locally, audio is not stored
Cross-platform - works on Windows, macOS and Linux
Simple interface - intuitive and easy to use
Customizable - configure transcription models, LLM providers, and actions

If you have any issues using the app, please open an issue

🎯 How to Use

1. Setup

Open Winky application
Complete the initial setup wizard:
- Sign in with your account (OAuth authentication)
- Configure your API keys:
  - OpenAI API key (get it from platform.openai.com)
  - Google AI API key (get it from console.cloud.google.com)
Choose your speech recognition mode:
- Cloud - use cloud-based transcription (OpenAI Whisper, Google AI)
- Local - use local fast-whisper for privacy and speed
Configure LLM settings:
- Choose your preferred LLM provider
- Select the model suitable for your needs
Set up quick actions:
- Configure custom hotkeys for actions
- Create and customize your action workflows

2. Usage

Use the floating microphone overlay to start voice recognition
Speak your command or question
Get instant AI-powered responses and actions
Use hotkeys for quick access to common actions
Access your profile, actions, and settings from the main window

3. Usage Tips

Position the floating microphone overlay where it's convenient for you
Customize hotkeys to match your workflow
Use local speech recognition for better privacy
Practice with different commands to get the best results

How to Use Locally

The examples below are implemented and tested on Windows 11. Steps may differ on other systems.

The assistant works in two stages:

Audio transcription
Getting an answer from the LLM

Each stage can be run locally.

Local Speech Recognition

In Winky settings select Mode -> Speech Recognition = Local.
In Winky settings choose one of Model -> Speech Recognition
In Winky settings choose Local transcription device: GPU (Graphics/NVIDIA) or CPU (Processor)

The local speech recognition server will be automatically installed and managed by Winky.

Local LLM Processing

Minimum recommended configuration:

CPU - 4 cores / 8 threads
GPU - 6 GB VRAM
RAM - 16 GB

In Winky settings select Mode -> LLM = Local.
In Winky settings choose a Model -> LLM from the available models (Ollama models)
Install Ollama

https://ollama.com/
Download the model chosen earlier
```
ollama pull <model-name>
```
Start Ollama
```
ollama serve
```

The first use after the opening of the program will be slower, since with local use of the AI models will be loaded in GPU or RAM, which takes time. Before important tasks, do a test run so that the subsequent calls are faster.

🔧 For Developers

Contributing

We welcome contributions to the project! If you want to contribute:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

See CONTRIBUTING.md for detailed guidelines.

Local Development

Requirements

Node.js 20+ (LTS)
Rust 1.80+ (for building Tauri)
npm or yarn

Installation

# Clone the repository
git clone https://github.com/placeholder/winky.git
cd winky

# Install dependencies
npm install

# Build the project
npm run build

# Run in development mode
npm run dev

Project Structure

src/
├── renderer/       # React renderer process (UI)
│   ├── app/        # Application logic and hooks
│   ├── components/ # React components
│   ├── context/    # React context providers
│   ├── features/   # Feature modules
│   ├── services/   # API and service layer
│   ├── windows/    # Window components
│   └── ...
├── shared/         # Shared types and utilities
└── ...
src-tauri/
├── src/             # Rust backend (Tauri)
└── ...

Available Commands

npm run dev - run in development mode
npm run build - build the project
npm run build:renderer - build only the renderer (frontend)
npm run dev:renderer - run renderer dev server only
npm run lint - check TypeScript types
npm run typecheck - same as lint
npm run preview - preview built frontend

Building for Different Platforms

Windows

npm run build

Creates:

Portable executable in src-tauri/target/release/

macOS

npm run build

Creates:

DMG archive for Intel and Apple Silicon

Note: For macOS builds, you may need to:

Install Xcode Command Line Tools: xcode-select --install

Linux

npm run build

Creates:

Portable directory in src-tauri/target/release/

Technologies

Tauri - cross-platform desktop application framework
React - UI library
TypeScript - typed JavaScript
Tailwind CSS - utility-first CSS framework
Vite - build tool and dev server
OpenAI API - AI integration
Google AI API - AI integration

Made with ❤️ for productivity and assistance

Name		Name	Last commit message	Last commit date
Latest commit History 514 Commits
.github/workflows		.github/workflows
.run		.run
build		build
docs		docs
public		public
scripts		scripts
src-tauri		src-tauri
src		src
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README_RU.md		README_RU.md
SECURITY.md		SECURITY.md
electron-builder.yml		electron-builder.yml
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.mts		vite.config.mts

Folders and files

Latest commit

History

Repository files navigation

Winky

Your cute voice assistant powered by AI

⭐ Star this repository if it helped you! ⭐

About Winky

Google Chrome Extension:

About This Repository

Table of Contents

🚀 Key Features

If you have any issues using the app, please open an issue

🎯 How to Use

1. Setup

2. Usage

3. Usage Tips

How to Use Locally

The assistant works in two stages:

Audio transcription

Getting an answer from the LLM

Each stage can be run locally.

Local Speech Recognition

Local LLM Processing

Install Ollama

Download the model chosen earlier

Start Ollama

The first use after the opening of the program will be slower, since with local use of the AI models will be loaded in GPU or RAM, which takes time. Before important tasks, do a test run so that the subsequent calls are faster.

🔧 For Developers

Contributing

Local Development

Requirements

Installation

Project Structure

Available Commands

Building for Different Platforms

Windows

macOS

Linux

Technologies

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Contributors

Uh oh!

Languages