JavaFAQ Home » Sound, multimedia
I tried to look at the classes for Java Speech technology... They are not
included into JDK. Even more, it seems SUN has no implementation for it. How we
can use it if there are no classes?
Yes, that's right. Sun does not ship an implementation of JSAPI. Instead, it
works with third party speech companies.
JSAPI (Java Speech API) allows Java applications to incorporate speech
technology into their user interfaces. It defines a cross-platform API to
support command and control recognizers, dictation systems and speech
JSAPI includes the Javadoc-style API documentation for the approximately 70
classes and interfaces in the API. The specification also includes a detailed
Programmer's Guide which explains both introductory and advanced speech
application programming with JSAPI. Two companion specifications are available:
JSML and JSGF: the Java Speech API Markup Language (JSML) and the Java Speech
API Grammar Format (JSGF) are companion specifications to the Java Speech API.
JSML (currently in beta) defines a standard text format for marking up text for
input to a speech synthesizer. JSGF version 1.0 defines a standard text format
for providing a grammar to a speech recognizer. Both specifications are
The Java Speech API is a freely available specification and therefore anyone is
welcome to develop an implementation. The following implementations are known to
FreeTTS on SourceForge.net
- Description: Open source speech
synthesizer written entirely in the Java programming language.
- Requirements: JDK 1.4. Read about more requirements on the
FreeTTS web site.
IBM's "Speech for Java"
- Description: Implementation based on
IBM's ViaVoice product, which supports continuous dictation, command and
control and speech synthesis. It supports all the European language versions
of ViaVoice -- US & UK English, French, German, Italian and Spanish -- plus
- Requirements: JDK 1.1.7 or later or JDK 1.2 on Windows 95 with
32MB, or Windows NT with 48MB. Both platforms also require an installation
IBM's "Speech for Java" on Linux
- Description: Beta version of "Speech
for Java" on Linux. Currently only supports speech recognition.
- Requirements: RedHat Linux 6.0 with 32MB, and
Blackdown JDK 1.1.7 with native thread support.
- Description: Implementation for use
with any recognition/TTS speech engine compliant with Microsoft's SAPI5 (with
SAPI4 support for TTS engines only). An additional package allows redirection
of audio data to/from Files, Lines and remote clients (using the
javax.sound.sampled package). Some examples demonstrate its use in applets in
NetscapeTM and IE browsers.
- Requirements: JDKTM 1.1 or
better, Windows 98, Me, 2000 or NT, and any SAPI 5.1, 5.0 or 4.0 compliant
speech engine (some of which can be downloaded from Microsoft's web site).
Hauspie's TTS for Java Speech API
- Description: Implementations based
upon ASR1600 and TTS3000 engines, which support command and control and speech
synthesis. Supports 10 different voices and associated whispering voices for
the English language. Provides control for pitch, pitch range, speaking rate,
- Requirements: Sun Solaris OS version 2.4 or later, JDK 1.1.5. Sun
Swing package (free download) for graphical Type-n-Talk demo.
- More information: Contact Edmund Kwan (mailto: ekwan at
lhs.com) , Director of Sales, Western Region Speech and Language
Technologies and Solutions (ekwan at lhs.com)
Conversa Web 3.0
- Description: Conversa Web is a
voice-enabled Web browser that provides a range of facilities for
voice-navigation of the web by speech recognition and text-to-speech. The
developers of Conversa Web chose to write a JSAPI implementation for the
- Requirements: Windows 95/98 or NT 4.0 running on Intel Pentium
166 MHz processor or faster (or equivalent). Minimum of 32 MB RAM (64 MB
recommended). Multimedia system: sound card and speakers. Microsoft Internet
Explorer 4.0 or higher.
- Description: Festival is a general
multi-lingual speech synthesis system developed by the Centre for Speech
Technology Research at the University of Edinburgh. It offers a full text to
speech system with various APIs, as well an environment for development and
research of speech synthesis techniques. It is written in C++ with a
Scheme-based command interpreter for general control and provides a binding to
the Java Speech API. Supports the English (British and American), Spanish and
- Requirements: Festival runs on Suns (SunOS and Solaris), FreeBSD,
Linux, SGIs, HPs and DEC Alphas and is portable to other Unix machines.
Preliminary support is available for Windows 95 and NT. For details and
requirements see the
Festival download page.
- Description: Elan Speech Cube is a
Multilingual, multichannel, cross-operating system text-to-speech software
component for client-server architecture. Speech Cube is available with 2 TTS
technologies (Elan Tempo : diphone concatenation and Elan Sayso : unit
selection), covering 11 languages. Speech Cube native Java client supports
- Requirements: JDK 1.3 or later on Windows NT/2000/XP, Linux or
Solaris 2.7/2.8, Speech Cube V4.2 and higher.
- About Elan Speech: Elan Speech is an established worldwide
provider of text-to-speech technology (TTS). Elan TTS transforms any IT
generated text into speech and reads it out loud.
Tip based on JSAPI FAQ
Our older tips: March 22, 2001 - October 21, 2002
All published and not published on the site tips read
Printer Friendly Page
Send to a Friend
Search here again if you need more info!