Skip to content

πŸ—£ Java Text to Speech (JSAPI2) engines (google cloud, cocoa, open jtalk, aquestalk(γ‚†γ£γγ‚Š), voicevox(γšγ‚“γ γ‚‚γ‚“), coeiroink, aivisspeech, google genai, qwen3-tts)

Notifications You must be signed in to change notification settings

umjammer/vavi-speech2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

141 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Release Java CI CodeQL Java

vavi-speech2

yukkuries

Text to Speech and Speech to Text (JSAPI2) engines for Java

Type Description Sythesizer Recognizer Quality Comment
AquesTalk10 AquesTalk, JNA βœ… - 😐 γ‚†γ£γγ‚Š
Google Cloud Text To Speech Google Cloud Text To Speech, Library βœ… 🚧 πŸ‘‘
Cocoa Rococoa, JNA βœ… 🚫 😐
Open JTalk jtalkdll, JNA βœ… - πŸ’©
VoiceVox VOICEVOX, REST βœ… - πŸ˜ƒ γšγ‚“γ γ‚‚γ‚“
CoeiroInk CoeiroInk, REST βœ… - πŸ˜ƒ γ€γγ‚ˆγΏγ‘γ‚ƒγ‚“
Gyutan (Open JTalk in Java) Gyutan, Library βœ… - πŸ’©
AivisSpeech Aivis Project, REST βœ… - πŸ‘‘
Google AI Studio Google Gemini API, Library βœ… - πŸš€
Qwen3-TTS Openai API, Library βœ… - πŸ’‘ voice cloning!

Install

maven

AquesTalk10

  • place AquesTalk10.framework into ~/Library/Frameworks
  • create symbolic link AquesTalk10.framework/AquesTalk as AquesTalk10.framework/AquesTalk10
  • write aquesTalk10DevKey into local.properties

Google Cloud Text To Speech

  • get token as json
  • set system property "vavi.speech.googlecloud.credential" your_json_path

Open JTalk

VOICEVOX

  • download the application
  • run the application before using this library

COEIROINK

  • download the application
  • run the application before using this library

DoCoMo AI Agent API (wip)

AivisSpeech

  • download the application
  • run the application before using this library

Google Gemini API (Google AI Studio)

  • get api key
  • set environment variable "GOOGLE_API_KEY" the api key

Qwen3-TTS

  • install https://github.com/umjammer/Qwen3-TTS-Openai-Fastapi (anything is ok as long as it's openai-compatible api)
  • run the server before using this library, don't forget to adjust port no.
  • default url is http://localhost:50090. this is modifiable by the system property vavi.speech.qwen3tts.url

Usage

system property

  • vavi.speech.voicevox.url ... the VOICEVOX api server url, default is http://localhost:50021.
  • vavi.speech.coeiroink.url ... the COEIROINK api server url, default is http://localhost:50032.
  • vavi.speech.aivis.url ... the AivisSpeech api server url, default is http://localhost:10101.
  • vavi.speech.qwen3tts.url ... the Qwen3-TTS api server url, default is http://localhost:50090.

system property (qwen3-tts specific)

  • vavi.speech.qwen3tts.clone ... use clone voice or not, default is false.
  • vavi.speech.qwen3tts.refAudio ... when using clone voice, set reference audio file path. (only wav file is tested)
  • vavi.speech.qwen3tts.refText ... when using clone voice, set reference text. (transcription of the audio above)

user

Reference

TODO


images by 霊倒, 魔理沙, γšγ‚“γ γ‚‚γ‚“

About

πŸ—£ Java Text to Speech (JSAPI2) engines (google cloud, cocoa, open jtalk, aquestalk(γ‚†γ£γγ‚Š), voicevox(γšγ‚“γ γ‚‚γ‚“), coeiroink, aivisspeech, google genai, qwen3-tts)

Topics

Resources

Stars

Watchers

Forks

Languages