|
|
|
/***************************************************** vim:set ts=4 sw=4 sts=4:
|
|
|
|
kspeech.h
|
|
|
|
KTTSD DCOP Interface
|
|
|
|
--------------------
|
|
|
|
Copyright:
|
|
|
|
(C) 2002-2003 by José Pablo Ezequiel "Pupeno" Fernández <pupeno@kde.org>
|
|
|
|
(C) 2003-2004 by Olaf Schmidt <ojschmidt@kde.org>
|
|
|
|
(C) 2004-2005 by Gary Cramblitt <garycramblitt@comcast.net>
|
|
|
|
-------------------
|
|
|
|
Original author: José Pablo Ezequiel "Pupeno" Fernández
|
|
|
|
******************************************************************************/
|
|
|
|
|
|
|
|
/***************************************************************************
|
|
|
|
* *
|
|
|
|
* This program is free software; you can redistribute it and/or modify *
|
|
|
|
* it under the terms of the GNU General Public License as published by *
|
|
|
|
* the Free Software Foundation; version 2 of the License. *
|
|
|
|
* *
|
|
|
|
***************************************************************************/
|
|
|
|
|
|
|
|
#ifndef _KSPEECH_H_
|
|
|
|
#define _KSPEECH_H_
|
|
|
|
|
|
|
|
#include <dcopobject.h>
|
|
|
|
#include <tqstringlist.h>
|
|
|
|
|
|
|
|
/**
|
|
|
|
* @interface KSpeech
|
|
|
|
*
|
|
|
|
* kspeech - the KDE Text-to-Speech API.
|
|
|
|
*
|
|
|
|
* @version 1.0 Draft 10
|
|
|
|
*
|
|
|
|
* @since KDE 3.4
|
|
|
|
*
|
|
|
|
* This class defines the DCOP interface for applications desiring to speak text.
|
|
|
|
* Applications may speak text by sending DCOP messages to application "kttsd" object "KSpeech".
|
|
|
|
*
|
|
|
|
* %KTTSD -- the KDE Text-to-Speech Deamon -- is the program that supplies the services
|
|
|
|
* in the KDE Text-to-Speech API.
|
|
|
|
*
|
|
|
|
* @warning The KSpeech interface is still being developed and is likely to change in the future.
|
|
|
|
*
|
|
|
|
* @section Features
|
|
|
|
*
|
|
|
|
* - Priority system for Screen Readers, warnings and messages, while still playing
|
|
|
|
* regular texts.
|
|
|
|
* - Long text is parsed into sentences. User may backup by sentence or part,
|
|
|
|
* replay, pause, and stop playing.
|
|
|
|
* - Handles multiple speaking applications. Text messages are treated like print jobs.
|
|
|
|
* Jobs may be created, started, stopped, paused, resumed, and deleted.
|
|
|
|
* - Speak contents of clipboard.
|
|
|
|
* - Speak KDE notifications.
|
|
|
|
* - Plugin-based text job filtering permits substitution for misspoken words,
|
|
|
|
* abbreviations, etc., transformation of XML or XHTML to SSML, and automatic
|
|
|
|
* choice of appropriate synthesis engine.
|
|
|
|
*
|
|
|
|
* @section Requirements
|
|
|
|
*
|
|
|
|
* You may build any KDE application to use KSpeech, since the interface is in tdelibs, but
|
|
|
|
* the tdeaccessibility package must be installed for KTTS to function.
|
|
|
|
*
|
|
|
|
* You will need a speech synthesis engine, such as Festival. See the KTTS Handbook
|
|
|
|
* for the latest information on installing and configuring speech engines and voices
|
|
|
|
* with KTTS.
|
|
|
|
*
|
|
|
|
* @section goals Design Goals
|
|
|
|
*
|
|
|
|
* The KDE Text-to-Speech API is designed with the following goals:
|
|
|
|
*
|
|
|
|
* - Support the features enumerated above.
|
|
|
|
* - Plugin-based architecture for support of a wide variety of speech synthesis
|
|
|
|
* engines and drivers.
|
|
|
|
* - Permit generation of speech from the command line (or via shell scripts)
|
|
|
|
* using the KDE DCOP utilities.
|
|
|
|
* - Provide a lightweight and easily usable interface for applications to
|
|
|
|
* generate speech output.
|
|
|
|
* - Applications need not be concerned about contention over the speech device.
|
|
|
|
* - Provide limited support for speech markup languages, such as Sable,
|
|
|
|
* Java %Speech Markup Language (JSML), and %Speech Markup Meta-language (SMML).
|
|
|
|
* - Provide limited support for embedded speech markers.
|
|
|
|
* - Asynchronous to prevent system blocking.
|
|
|
|
* - Plugin-based audio architecture. Currently supports aRts but will support
|
|
|
|
* additional audio engines in the future, such as gstreamer.
|
|
|
|
* - Compatible with original %KTTSD API as developed by José Pablo Ezequiel
|
|
|
|
* "Pupeno" Fernández (avoid breaking existing applications).
|
|
|
|
*
|
|
|
|
* Architecturally, applications interface with %KTTSD, which performs queueing,
|
|
|
|
* speech job managment, plugin management and sentence parsing. %KTTSD interfaces with a
|
|
|
|
* %KTTSD speech plugin(s), which then interfaces with the speech engine(s) or driver(s).
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
application
|
|
|
|
^
|
|
|
|
| via DCOP (the KDE Text-to-Speech API)
|
|
|
|
v
|
|
|
|
kttsd
|
|
|
|
^
|
|
|
|
| KTTSD plugin API
|
|
|
|
v
|
|
|
|
kttsd plugin
|
|
|
|
^
|
|
|
|
|
|
|
|
|
v
|
|
|
|
speech engine
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* The %KTTSD Plugin API is documented in PluginConf in the tdeaccessibility module.
|
|
|
|
*
|
|
|
|
* There is a separate GUI application, called kttsmgr, for providing %KTTSD
|
|
|
|
* configuration and job management.
|
|
|
|
*
|
|
|
|
* kttsd maintains 4 types of speech output:
|
|
|
|
* - Screen Reader Output
|
|
|
|
* - Warnings
|
|
|
|
* - Messages
|
|
|
|
* - Text Jobs
|
|
|
|
*
|
|
|
|
* Method sayScreenReaderOutput speaks Screen Reader output.
|
|
|
|
* It pre-empts any other speech in progress,
|
|
|
|
* including other Screen Reader outputs, i.e., it is not a queue.
|
|
|
|
* This method is reserved for use by Screen Readers.
|
|
|
|
*
|
|
|
|
* Methods sayWarning and sayMessage place messages into the Warnings and
|
|
|
|
* Messages queues respectively. Warnings take priority over messages, which take priority
|
|
|
|
* over text jobs. Warnings and messages are spoken when the currently-speaking
|
|
|
|
* sentence of a text job is finished.
|
|
|
|
*
|
|
|
|
* setText places text into the text job queue. startText begins speaking jobs.
|
|
|
|
* When one job finishes, the next job begins. Method appendText adds
|
|
|
|
* additional parts to a text job. Within a text job, the application (and user
|
|
|
|
* via the kttsmgr GUI), may back up or advance by sentence or part, or rewind
|
|
|
|
* to the beginning.
|
|
|
|
* See jumpToTextPart and moveRelTextSentence.
|
|
|
|
* Text jobs may be paused, stopped, and resumed or deleted from the queue.
|
|
|
|
* See pauseText, stopText, resumeText, and removeText.
|
|
|
|
*
|
|
|
|
* @section cmdline DCOP Command-line Interface
|
|
|
|
*
|
|
|
|
* To create a text job to be spoken
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
dcop kttsd KSpeech setText <text> <talker>
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* where \<text\> is the text to be spoken, and \<talker\> is usually a language code
|
|
|
|
* such as "en", "cy", etc.
|
|
|
|
*
|
|
|
|
* Example.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
dcop kttsd KSpeech setText "This is a test." "en"
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* To start speaking the text.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
dcop kttsd KSpeech startText 0
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* You can combine the setText and startText commands into a single command.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
dcop kttsd KSpeech sayText <text> <talker>
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* @since KDE 3.5
|
|
|
|
*
|
|
|
|
* To stop speaking and rewind to the beginning of the text.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
dcop kttsd KSpeech stopText 0
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* Depending upon the speech plugin used, speaking may not immediately stop.
|
|
|
|
*
|
|
|
|
* To stop and remove a text job.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
dcop kttsd KSpeech removeText 0
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* Note: For more information about talker codes, see talkers below.
|
|
|
|
*
|
|
|
|
* @section programming Calling KTTSD from a Program
|
|
|
|
*
|
|
|
|
* There are two methods of making DCOP calls from your application to %KTTSD.
|
|
|
|
*
|
|
|
|
* - Manually code them using dcopClient object. See tdebase/konqueror/kttsplugin/tdehtmlkttsd.cpp
|
|
|
|
* for an example. This method is recommended if you want to make a few simple calls to KTTSD.
|
|
|
|
* - Use kspeech_stub as described below. This method generates the marshalling code for you
|
|
|
|
* and is recommended for a more complex speech-enabled applications. kcmkttsmgr in the
|
|
|
|
* tdeaccessibility module is an example that uses this method.
|
|
|
|
*
|
|
|
|
* To make DCOP calls from your program using kspeech_stub, follow these steps:
|
|
|
|
*
|
|
|
|
* 1. Include kspeech_stub.h in your code. Derive an object from the KSpeech_stub interface.
|
|
|
|
* For example, suppose you are developing a KPart and want to call %KTTSD.
|
|
|
|
* Your class declaration might look like this:
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
#include <kspeech_stub.h>
|
|
|
|
class MyPart: public KParts::ReadOnlyPart, public KSpeech_stub {
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* 2. In your class constructor, initialize DCOPStub, giving it the sender
|
|
|
|
* "kttsd", object "KSpeech".
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
MyPart::MyPart(TQWidget *parent, const char *name) :
|
|
|
|
KParts::ReadOnlyPart(parent, name),
|
|
|
|
DCOPStub("kttsd", "KSpeech") {
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* 3. See if KTTSD is running, and if not, start it.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
DCOPClient *client = dcopClient();
|
|
|
|
client->attach();
|
|
|
|
if (!client->isApplicationRegistered("kttsd")) {
|
|
|
|
TQString error;
|
|
|
|
if (TDEApplication::startServiceByDesktopName("kttsd", TQStringList(), &error))
|
|
|
|
cout << "Starting KTTSD failed with message " << error << endl;
|
|
|
|
}
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* If you want to detect if KTTSD is installed without starting it, use this code.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
TDETrader::OfferList offers = TDETrader::self()->query("DCOP/Text-to-Speech", "Name == 'KTTSD'");
|
|
|
|
if (offers.count() > 0)
|
|
|
|
{
|
|
|
|
// KTTSD is installed.
|
|
|
|
}
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* Typically, you would do this to hide a menu item or button if KTTSD is not installed.
|
|
|
|
*
|
|
|
|
* 4. Make calls to KTTSD in your code.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
uint jobNum = setText("Hello World", "en");
|
|
|
|
startText(jobNum);
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* 4. Add kspeech_DIR and kspeech.stub to your Makefile.am. Example:
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
kspeech_DIR = $(kde_includes)
|
|
|
|
libmypart_la_SOURCES = kspeech.stub
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* @section signals Signals Emitted by KTTSD
|
|
|
|
*
|
|
|
|
* %KTTSD emits a number of DCOP signals, which provide information about sentences spoken,
|
|
|
|
* text jobs started, stopped, paused, resumed, finished, or deleted and markers seen.
|
|
|
|
* In general, these signals are broadcast to any application that connects to them.
|
|
|
|
* Applications should check the appId argument to determine whether the signal belongs to
|
|
|
|
* them or not.
|
|
|
|
*
|
|
|
|
* To receive %KTTSD DCOP signals, follow these steps:
|
|
|
|
*
|
|
|
|
* 1. Include kspeechsink.h in your code. Derive an object from the KSpeechSink interface
|
|
|
|
* and declare a method for each signal you'd like to receive. For example,
|
|
|
|
* if you were coding a KPart and wanted to receive the KTTSD signal sentenceStarted:
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
#include <kspeechsink.h>
|
|
|
|
class MyPart:
|
|
|
|
public KParts::ReadOnlyPart,
|
|
|
|
virtual public KSpeechSink
|
|
|
|
{
|
|
|
|
protected:
|
|
|
|
ASYNC sentenceStarted(const TQCString& appId, const uint jobNum, const uint seq);
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* You can combine sending and receiving in one object.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
#include <kspeechsink.h>
|
|
|
|
class MyPart:
|
|
|
|
public KParts::ReadOnlyPart,
|
|
|
|
public KSpeech_stub,
|
|
|
|
virtual public KSpeechSink
|
|
|
|
{
|
|
|
|
protected:
|
|
|
|
ASYNC sentenceStarted(const TQCString& appId, const uint jobNum, const uint seq);
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* See below for the signals you can declare.
|
|
|
|
*
|
|
|
|
* 2. In your class constructor, initialize DCOPObject with the name of your DCOP
|
|
|
|
* receiving object.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
MyPart::MyPart(TQWidget *parent, const char *name) :
|
|
|
|
KParts::ReadOnlyPart(parent, name),
|
|
|
|
DCOPObject("mypart_kspeechsink") {
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* Use any name you like.
|
|
|
|
*
|
|
|
|
* 3. Where appropriate (usually in your constructor), make sure your DCOPClient
|
|
|
|
* is registered and connect the %KTTSD DCOP signals to your declared receiving
|
|
|
|
* methods.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
// Register DCOP client.
|
|
|
|
DCOPClient *client = kapp->dcopClient();
|
|
|
|
if (!client->isRegistered())
|
|
|
|
{
|
|
|
|
client->attach();
|
|
|
|
client->registerAs(kapp->name());
|
|
|
|
}
|
|
|
|
// Connect KTTSD DCOP signals to our slots.
|
|
|
|
connectDCOPSignal("kttsd", "KSpeech",
|
|
|
|
"sentenceStarted(TQCString,uint,uint)",
|
|
|
|
"sentenceStarted(TQCString,uint,uint)",
|
|
|
|
false);
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* Notice that the argument signatures differ slightly from the actual declarations. For
|
|
|
|
* example
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
ASYNC sentenceStarted(const TQCString& appId, const uint jobNum, const uint seq);
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* becomes
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
"sentenceStarted(TQCString,uint,uint)",
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* in the connectDCOPSignal call.
|
|
|
|
*
|
|
|
|
* 4. Write the definition for the received signal. Be sure to check whether the signal
|
|
|
|
* is intended for your application.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
ASYNC MyPart::sentenceStarted(const TQCString& appId, const uint jobNum, const uint seq)
|
|
|
|
{
|
|
|
|
// Check appId to determine if this is our signal.
|
|
|
|
if (appId != dcopClient()->appId()) return;
|
|
|
|
// Do something here.
|
|
|
|
}
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* 5. Add kspeechsink_DIR and kspeechsink.skel to your Makefile.am. Example for an app
|
|
|
|
* both sending and receiving.
|
|
|
|
*
|
|
|
|
@verbatim
|
|
|
|
kspeech_DIR = $(kde_includes)
|
|
|
|
kspeechsink_DIR = $(kde_includes)
|
|
|
|
libmypart_la_SOURCES = kspeech.stub kspeechsink.skel
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* @section talkers Talkers, Talker Codes, and Plugins
|
|
|
|
*
|
|
|
|
* Many of the methods permit you to specify a desired "talker". This
|
|
|
|
* may be a simple language code, such as "en" for English, "es" for Spanish, etc.
|
|
|
|
* Code as NULL to use the default configured talker.
|
|
|
|
*
|
|
|
|
* Within KTTSMGR, the user has the ability to configure more than one talker for each language,
|
|
|
|
* with different voices, genders, volumes, and talking speeds.
|
|
|
|
*
|
|
|
|
* Talker codes serve two functions:
|
|
|
|
* - They identify configured plugins, and
|
|
|
|
* - They provide a way for applications to specify the desired speaking attributes
|
|
|
|
* that influence the choice of plugin to speak text.
|
|
|
|
*
|
|
|
|
* A Talker Code consists of a series of XML tags and attributes.
|
|
|
|
* An example of a full Talker Code with all attributes specified is
|
|
|
|
* \code
|
|
|
|
* <voice lang="en" name="kal" gender="male"/>
|
|
|
|
* <prosody volume="soft" rate="fast"/>
|
|
|
|
* <kttsd synthesizer="Festival" />
|
|
|
|
* \endcode
|
|
|
|
*
|
|
|
|
* (The @e voice and @e prosody tags are adapted from the W3C Speech Synthesis
|
|
|
|
* Markup Language (SSML) and Java Speech Markup Language (JSML).
|
|
|
|
* The @e kttsd tag is an extension to the SMML and JSML languages to support
|
|
|
|
* named synthesizers and text encodings.)
|
|
|
|
* %KTTS doesn't really care about the @e voice, @e prosody, and @e kttsd tags. In fact,
|
|
|
|
* they may be omitted and just the attributes specified. The example above then
|
|
|
|
* becomes
|
|
|
|
*
|
|
|
|
* lang="en" name="kal" gender="male" volume="soft" rate="fast"
|
|
|
|
* synthesizer="Festival"
|
|
|
|
*
|
|
|
|
* The attributes may be specified in any order.
|
|
|
|
*
|
|
|
|
* For clarity, the rest of the discussion
|
|
|
|
* will omit the @e voice, @e prosody, and @e kttsd tags.
|
|
|
|
*
|
|
|
|
* The attributes that make up a talker code are:
|
|
|
|
*
|
|
|
|
* - @e lang. Language code and optional country code.
|
|
|
|
* Examples: en, es, en_US, en_GB. Codes
|
|
|
|
* are case in-sensitive and hyphen (-) or underscore (_) may be
|
|
|
|
* used to separate the country code from the language code.
|
|
|
|
* - @e synthesizer. The name of the synthesizer (plugin) used to produce the speech.
|
|
|
|
* - @e gender. May be either "male", "female", or "neutral".
|
|
|
|
* - @e name. The name of the voice code.
|
|
|
|
* The choice of voice codes is synthesizer-specific.
|
|
|
|
* - @e volume. May be "loud", "medium", or "quiet". A synonym for "quiet" is
|
|
|
|
* "soft".
|
|
|
|
* - @e rate. May be "fast", "medium", or "slow".
|
|
|
|
*
|
|
|
|
* Each plugin, once it has been configured by a user in kttsmgr, returns a
|
|
|
|
* fully-specified talker code to identify itself. If the plugin supports it,
|
|
|
|
* the user may configure another instance of the plugin with a different set
|
|
|
|
* of attributes. This is the difference between a "plugin" and a "talker".
|
|
|
|
* A talker is a configured instance of a plugin. Each plugin (if it supports it)
|
|
|
|
* may be configured as multiple talkers.
|
|
|
|
*
|
|
|
|
* When the user configures %KTTSD, she configures one or more talkers and then
|
|
|
|
* places them in preferred order, top to bottom in kttsmgr. In effect,
|
|
|
|
* she specifies her preferences for each of the talkers.
|
|
|
|
*
|
|
|
|
* When applications specify a talker code, they need not (and typically do not)
|
|
|
|
* give a full specification. An example of a talker code with only some of the
|
|
|
|
* attributes specified might be
|
|
|
|
*
|
|
|
|
* lang="en" gender="female"
|
|
|
|
*
|
|
|
|
* If the talker code is not in XML attribute format, it assumed to be a @e lang
|
|
|
|
* attribute. So the talker code
|
|
|
|
*
|
|
|
|
* en
|
|
|
|
*
|
|
|
|
* is interpreted as
|
|
|
|
*
|
|
|
|
* lang="en"
|
|
|
|
*
|
|
|
|
* When a program requests a talker code in calls to setText, appendText,
|
|
|
|
* sayMessage, sayWarning, and sayScreenReaderOutput,
|
|
|
|
* %KTTSD tries to match the requested talker code to the closest matching
|
|
|
|
* configured talker.
|
|
|
|
*
|
|
|
|
* The @e lang attribute has highest priority (attempting to speak English with
|
|
|
|
* a Spanish synthesizer would likely be unintelligible). So the language
|
|
|
|
* attribute is said to have "priority".
|
|
|
|
* If an application does not specify a language attribute, a default one will be assumed.
|
|
|
|
* The rest of the attributes are said to be "preferred". If %KTTSD cannot find
|
|
|
|
* a talker with the exact preferred attributes requested, the closest matching
|
|
|
|
* talker will likely still be understandable.
|
|
|
|
*
|
|
|
|
* An application may specify that one or more of the attributes it gives in a talker
|
|
|
|
* code have priority by preceeding each priority attribute with an asterisk.
|
|
|
|
* For example, the following talker code
|
|
|
|
*
|
|
|
|
* lang="en" gender="*female" volume="soft"
|
|
|
|
*
|
|
|
|
* means that the application wants to use a talker that supports American English language
|
|
|
|
* and Female gender. If there is more than one such talker, one that supports
|
|
|
|
* Soft volume would be preferred. Notice that a talker configured as English, Male,
|
|
|
|
* and Soft volume would not be picked as long as an English Female talker is
|
|
|
|
* available.
|
|
|
|
*
|
|
|
|
* The algorithm used by %KTTSD to find a matching talker is as follows:
|
|
|
|
*
|
|
|
|
* - If language code is not specified by the application, assume default configured
|
|
|
|
* by user. The primary language code automatically has priority.
|
|
|
|
* - (Note: This is not yet implemented.)
|
|
|
|
* If there are no talkers configured in the language, %KTTSD will attempt
|
|
|
|
* to automatically configure one (see automatic configuraton discussion below)
|
|
|
|
* - The talker that matches on the most priority attributes wins.
|
|
|
|
* - If a tie, the one that matches on the most preferred attributes wins.
|
|
|
|
* - If there is still a tie, the one nearest the top of the kttsmgr display
|
|
|
|
* (first configured) will be chosen.
|
|
|
|
*
|
|
|
|
* Language codes actually consist of two parts, a language code and an optional
|
|
|
|
* country code. For example, en_GB is English (United Kingdom). The language code is
|
|
|
|
* treated as a priority attribute, but the country code (if specified) is treated
|
|
|
|
* as preferred. So for example, if an application requests the following
|
|
|
|
* talker code
|
|
|
|
*
|
|
|
|
* lang="en_GB" gender="male" volume="medium"
|
|
|
|
*
|
|
|
|
* then a talker configured as lang="en" gender="male" volume="medium" would be
|
|
|
|
* picked over one configured as lang="en_GB" gender="female" volume="soft",
|
|
|
|
* since the former matches on two preferred attributes and the latter only on the
|
|
|
|
* preferred attribute GB. An application can override this and make the country
|
|
|
|
* code priority with an asterisk. For example,
|
|
|
|
*
|
|
|
|
* lang="*en_GB" gender="male" volume="medium"
|
|
|
|
*
|
|
|
|
* To specify that American English is priority, put an asterisk in front of
|
|
|
|
* en_US, like this.
|
|
|
|
*
|
|
|
|
* lang="*en_US" gender="male" volume="medium"
|
|
|
|
*
|
|
|
|
* Here the application is indicating that a talker that speaks American English
|
|
|
|
* has priorty over one that speaks a different form of English.
|
|
|
|
*
|
|
|
|
* (Note: Not yet implemented).
|
|
|
|
* If a language code is specified, and no plugin is currently configured
|
|
|
|
* with a matching language code, %KTTSD will attempt to automatically
|
|
|
|
* load and configure a plugin to support the requested language. If
|
|
|
|
* there is no such plugin, or there is a plugin but it cannot automatically
|
|
|
|
* configure itself, %KTTSD will pick one of the configured plugins using the
|
|
|
|
* algorithm given above.
|
|
|
|
*
|
|
|
|
* Notice that %KTTSD will always pick a talker, even if it is a terrible match.
|
|
|
|
* (The principle is that something heard is better than nothing at all. If
|
|
|
|
* it sounds terrible, user will change his configuration.)
|
|
|
|
* If an attribute is absolutely mandatory -- in other words the application
|
|
|
|
* must speak with the attribute or not at all -- the application can determine if
|
|
|
|
* there are any talkers configured with the attribute by calling getTalkers,
|
|
|
|
* and if there are none, display an error message to the user.
|
|
|
|
*
|
|
|
|
* Applications can implement their own talker-matching algorithm by
|
|
|
|
* calling getTalkers, then finding the desired talker from the returned
|
|
|
|
* list. When the full talker code is passed in, %KKTSD will find an exact
|
|
|
|
* match and use the specified talker.
|
|
|
|
*
|
|
|
|
* If an application requires a configuration that user has not created,
|
|
|
|
* it should display a message to user instructing them to run kttsmgr and
|
|
|
|
* configure the desired talker. (This must be done interactively because
|
|
|
|
* plugins often need user assistance locating voice files, etc.)
|
|
|
|
*
|
|
|
|
* The above scheme is designed to balance the needs
|
|
|
|
* of applications against user preferences. Applications are given the control
|
|
|
|
* they @e might need, without unnecessarily burdening the application author.
|
|
|
|
* If you are an application author, the above discussion might seem overly
|
|
|
|
* complicated. It isn't really all that complicated. Here are rules of thumb:
|
|
|
|
*
|
|
|
|
* - It is legitimate to give a NULL (0) talker code, in which case, the user's default
|
|
|
|
* talker will be used.
|
|
|
|
* - If you know the language code, give that in the talker code, otherwise
|
|
|
|
* leave it out.
|
|
|
|
* - If there is an attribute your application @e requires for proper functioning,
|
|
|
|
* specify that with an asterisk in front of it. For example, your app might
|
|
|
|
* speak in two different voices, Male and Female. (Since your
|
|
|
|
* app requires both genders, call getTalkers to determine if both genders
|
|
|
|
* are available, and if not, advise user to configure them. Better yet,
|
|
|
|
* give the user a choice of available distinquishing attributes
|
|
|
|
* (loud/soft, fast/slow, etc.)
|
|
|
|
* - If there are other attributes you would prefer, specify those without an
|
|
|
|
* asterisk, but leave them out if it doesn't really make any difference
|
|
|
|
* to proper functioning of your application. Let the user decide them
|
|
|
|
* when they configure %KTTS.
|
|
|
|
*
|
|
|
|
* One final note about talkers. %KTTSD does talker matching for each sentence
|
|
|
|
* spoken, just before the sentence is sent to a plugin for synthesis. Therefore,
|
|
|
|
* the user can change the effective talker in mid processing of a text job by
|
|
|
|
* changing his preferences, or even deleting or adding new talkers to the configuration.
|
|
|
|
*
|
|
|
|
* @section markup Speech Markup
|
|
|
|
*
|
|
|
|
* Note: %Speech Markup is not yet fully implemented in %KTTSD.
|
|
|
|
*
|
|
|
|
* Each of the five methods for queueing text to be spoken -- sayScreenReaderOutput,
|
|
|
|
* setText, appendText, sayMessage, and sayWarning -- may contain speech markup,
|
|
|
|
* provided that the plugin the user has configured supports that markup. The markup
|
|
|
|
* languages and plugins currently supported are:
|
|
|
|
*
|
|
|
|
* - %Speech Synthesis Markup language (SSML): Festival and Hadifix.
|
|
|
|
*
|
|
|
|
* This may change in the future as synthesizers improve.
|
|
|
|
*
|
|
|
|
* Before including markup in the text sent to kttsd, the application should
|
|
|
|
* query whether the currently-configured plugin
|
|
|
|
* supports the markup language by calling supportsMarkup.
|
|
|
|
*
|
|
|
|
* It it does not support the markup, it will be stripped out of the text.
|
|
|
|
*
|
|
|
|
* @section markers Support for Markers
|
|
|
|
*
|
|
|
|
* Note: Markers are not yet implemented in %KTTSD.
|
|
|
|
*
|
|
|
|
* When using a speech markup language, such as Sable, JSML, or SSML, the application may embed
|
|
|
|
* named markers into the text. If the user's chosen speech plugin supports markers, %KTTSD
|
|
|
|
* will emit DCOP signal markerSeen when the speech engine encounters the marker.
|
|
|
|
* Depending upon the speech engine and plugin, this may occur either when the speech engine
|
|
|
|
* encounters the marker during synthesis from text to speech, or when the speech is actually
|
|
|
|
* spoken on the audio device. The calling application can call the supportsMarkers
|
|
|
|
* method to determine if the currently configured plugin supports markers or not.
|
|
|
|
*
|
|
|
|
* @section sentenceparsing Sentence Parsing
|
|
|
|
*
|
|
|
|
* Not all speech engines provide robust capabilities for stopping synthesis that is in progress.
|
|
|
|
* To compensate for this, %KTTSD parses text jobs given to it by the setText and
|
|
|
|
* appendText methods into sentences and sends the sentences to the speech
|
|
|
|
* plugin one at a time. In this way, should the user wish to stop the speech
|
|
|
|
* output, they can do so, and the worst that will happen is that the last sentence
|
|
|
|
* will be completed. This is called Sentence Boundary Detection (SBD).
|
|
|
|
*
|
|
|
|
* Sentence Boundary Detection also permits the user to rewind by sentences.
|
|
|
|
*
|
|
|
|
* The default sentence delimiter used for plain text is as follows:
|
|
|
|
*
|
|
|
|
* - A period (.), question mark (?), exclamation mark (!), colon (:), or
|
|
|
|
* semi-colon (;) followed by whitespace (including newline), or
|
|
|
|
* - Two newlines in a row separated by optional whitespace, or
|
|
|
|
* - The end of the text.
|
|
|
|
*
|
|
|
|
* When given text containing speech markup, %KTTSD automatically determines the markup type
|
|
|
|
* and parses based on the sentence semantics of the markup language.
|
|
|
|
*
|
|
|
|
* An application may change the sentence delimiter by calling setSentenceDelimiter
|
|
|
|
* prior to calling setText. Changing the delimiter does not affect other
|
|
|
|
* applications.
|
|
|
|
*
|
|
|
|
* Text given to %KTTSD via the sayWarning, sayMessage, and sayScreenReaderOutput
|
|
|
|
* methods is @e not parsed into sentences. For this reason, applications
|
|
|
|
* should @e not send long messages with these methods.
|
|
|
|
*
|
|
|
|
* Sentence Boundary Detection is implemented as a plugin SBD filter. See
|
|
|
|
* filters for more information.
|
|
|
|
*
|
|
|
|
* @section filters Filters
|
|
|
|
*
|
|
|
|
* Users may specify filters in the kttsmgr GUI. Filters are plugins that modify the text
|
|
|
|
* to be spoken or change other characteristics of jobs. Currently, the following filter plugins
|
|
|
|
* are available:
|
|
|
|
*
|
|
|
|
* - String Replacer. Permits users to substitute for mispoken words, or vocalize chat
|
|
|
|
* emoticons.
|
|
|
|
* - XML Transformer. Given a particular XML or XHTML format, permits conversion of the
|
|
|
|
* XML to SSML (Speech Synthesis Markup Language) using XSLT (XML Style Language - Transforms)
|
|
|
|
* stylesheets.
|
|
|
|
* - Talker Chooser. Permits users to redirect jobs from one configured Talker to another
|
|
|
|
* based on the contents of the job or application that sent it.
|
|
|
|
*
|
|
|
|
* Additional plugins may be available in the future.
|
|
|
|
*
|
|
|
|
* In additional to these regular filters, KTTS also implements Sentence Boundary Detection (SBD)
|
|
|
|
* as a plugin filter. See sentenceparsing for more information.
|
|
|
|
*
|
|
|
|
* Regular filters are applied to Warnings, Messages, and Text jobs. SBD filters are
|
|
|
|
* only applied to regular Text jobs; they are not applied to Warnings and Messages. Screen
|
|
|
|
* Reader Outputs are never filtered.
|
|
|
|
*
|
|
|
|
* @section authors Authors
|
|
|
|
*
|
|
|
|
* @author José Pablo Ezequiel "Pupeno" Fernández <pupeno@kde.org>
|
|
|
|
* @author Gary Cramblitt <garycramblitt@comcast.net>
|
|
|
|
* @author Olaf Schmidt <ojschmidt@kde.org>
|
|
|
|
* @author Gunnar Schmi Dt <gunnar@schmi-dt.de>
|
|
|
|
*/
|
|
|
|
|
|
|
|
// NOTE: kspeech class is now obsolete. Please use KSpeech instead.
|
|
|
|
|
|
|
|
class KSpeech : virtual public DCOPObject {
|
|
|
|
K_DCOP
|
|
|
|
|
|
|
|
public:
|
|
|
|
/**
|
|
|
|
* @enum kttsdJobState
|
|
|
|
* Job states returned by method getTextJobState.
|
|
|
|
*/
|
|
|
|
enum kttsdJobState
|
|
|
|
{
|
|
|
|
jsQueued = 0, /**< Job has been queued but is not yet speakable. */
|
|
|
|
jsSpeakable = 1, /**< Job is speakable, but is not speaking. */
|
|
|
|
jsSpeaking = 2, /**< Job is currently speaking. */
|
|
|
|
jsPaused = 3, /**< Job has been paused. */
|
|
|
|
jsFinished = 4 /**< Job is finished and is deleteable. */
|
|
|
|
};
|
|
|
|
|
|
|
|
/**
|
|
|
|
* @enum kttsdMarkupType
|
|
|
|
* %Speech markup language types.
|
|
|
|
*/
|
|
|
|
enum kttsdMarkupType
|
|
|
|
{
|
|
|
|
mtPlain = 0, /**< Plain text */
|
|
|
|
mtJsml = 1, /**< Java %Speech Markup Language */
|
|
|
|
mtSsml = 2, /**< %Speech Synthesis Markup Language */
|
|
|
|
mtSable = 3, /**< Sable 2.0 */
|
|
|
|
mtHtml = 4 /**< HTML @since 3.5 */
|
|
|
|
};
|
|
|
|
|
|
|
|
k_dcop:
|
|
|
|
/** @name DCOP Methods */
|
|
|
|
//@{
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Determine whether the currently-configured speech plugin supports a speech markup language.
|
|
|
|
* @param talker Code for the talker to do the speaking. Example "en".
|
|
|
|
* If NULL, defaults to the user's default talker.
|
|
|
|
* @param markupType The kttsd code for the desired speech markup language.
|
|
|
|
* @return True if the plugin currently configured for the indicated
|
|
|
|
* talker supports the indicated speech markup language.
|
|
|
|
* @see kttsdMarkupType
|
|
|
|
*/
|
|
|
|
virtual bool supportsMarkup(const TQString &talker, uint markupType = 0) const = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Determine whether the currently-configured speech plugin supports markers in speech markup.
|
|
|
|
* @param talker Code for the talker to do the speaking. Example "en".
|
|
|
|
* If NULL, defaults to the user's default talker.
|
|
|
|
* @return True if the plugin currently configured for the indicated
|
|
|
|
* talker supports markers.
|
|
|
|
*/
|
|
|
|
virtual bool supportsMarkers(const TQString &talker) const = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Say a message as soon as possible, interrupting any other speech in progress.
|
|
|
|
* IMPORTANT: This method is reserved for use by Screen Readers and should not be used
|
|
|
|
* by any other applications.
|
|
|
|
* @param msg The message to be spoken.
|
|
|
|
* @param talker Code for the talker to do the speaking. Example "en".
|
|
|
|
* If NULL, defaults to the user's default talker.
|
|
|
|
* If no plugin has been configured for the specified Talker code,
|
|
|
|
* defaults to the closest matching talker.
|
|
|
|
*
|
|
|
|
* If an existing Screen Reader output is in progress, it is stopped and discarded and
|
|
|
|
* replaced with this new message.
|
|
|
|
*/
|
|
|
|
virtual ASYNC sayScreenReaderOutput(const TQString &msg, const TQString &talker) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Say a warning. The warning will be spoken when the current sentence
|
|
|
|
* stops speaking and takes precedence over Messages and regular text. Warnings should only
|
|
|
|
* be used for high-priority messages requiring immediate user attention, such as
|
|
|
|
* "WARNING. CPU is overheating."
|
|
|
|
* @param warning The warning to be spoken.
|
|
|
|
* @param talker Code for the talker to do the speaking. Example "en".
|
|
|
|
* If NULL, defaults to the user's default talker.
|
|
|
|
* If no plugin has been configured for the specified Talker code,
|
|
|
|
* defaults to the closest matching talker.
|
|
|
|
*/
|
|
|
|
virtual ASYNC sayWarning(const TQString &warning, const TQString &talker) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Say a message. The message will be spoken when the current sentence stops speaking
|
|
|
|
* but after any warnings have been spoken.
|
|
|
|
* Messages should be used for one-shot messages that can't wait for
|
|
|
|
* normal text messages to stop speaking, such as "You have mail.".
|
|
|
|
* @param message The message to be spoken.
|
|
|
|
* @param talker Code for the talker to do the speaking. Example "en".
|
|
|
|
* If NULL, defaults to the user's default talker.
|
|
|
|
* If no talker has been configured for the specified talker code,
|
|
|
|
* defaults to the closest matching talker.
|
|
|
|
*/
|
|
|
|
virtual ASYNC sayMessage(const TQString &message, const TQString &talker) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Sets the GREP pattern that will be used as the sentence delimiter.
|
|
|
|
* @param delimiter A valid GREP pattern.
|
|
|
|
*
|
|
|
|
* The default sentence delimiter is
|
|
|
|
@verbatim
|
|
|
|
([\\.\\?\\!\\:\\;])(\\s|$|(\\n *\\n))
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* Note that backward slashes must be escaped.
|
|
|
|
* When %KTTSD parses the text, it replaces all tabs, spaces, and formfeeds
|
|
|
|
* with a single space, and then replaces the sentence delimiters using
|
|
|
|
* the following statement:
|
|
|
|
@verbatim
|
|
|
|
TQString::replace(sentenceDelimiter, "\\1\t");
|
|
|
|
@endverbatim
|
|
|
|
*
|
|
|
|
* which replaces all sentence delimiters with a tab, but
|
|
|
|
* preserving the first capture text (first parenthesis). In other
|
|
|
|
* words, the sentence punctuation is preserved.
|
|
|
|
* The tab is later used to separate the text into sentences.
|
|
|
|
*
|
|
|
|
* Changing the sentence delimiter does not affect other applications.
|
|
|
|
*
|
|
|
|
* @see sentenceparsing
|
|
|
|
*/
|
|
|
|
virtual ASYNC setSentenceDelimiter(const TQString &delimiter) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Queue a text job. Does not start speaking the text.
|
|
|
|
* @param text The message to be spoken.
|
|
|
|
* @param talker Code for the talker to do the speaking. Example "en".
|
|
|
|
* If NULL, defaults to the user's default plugin.
|
|
|
|
* If no plugin has been configured for the specified Talker code,
|
|
|
|
* defaults to the closest matching talker.
|
|
|
|
* @return Job number.
|
|
|
|
*
|
|
|
|
* Plain text is parsed into individual sentences using the current sentence delimiter.
|
|
|
|
* Call setSentenceDelimiter to change the sentence delimiter prior to
|
|
|
|
* calling setText.
|
|
|
|
* Call getTextCount to retrieve the sentence count after calling setText.
|
|
|
|
*
|
|
|
|
* The text may contain speech mark language, such as Sable, JSML, or SSML,
|
|
|
|
* provided that the speech plugin/engine support it. In this case,
|
|
|
|
* sentence parsing follows the semantics of the markup language.
|
|
|
|
*
|
|
|
|
* Call startText to mark the job as speakable and if the
|
|
|
|
* job is the first speakable job in the queue, speaking will begin.
|
|
|
|
*
|
|
|
|
* @see getTextCount
|
|
|
|
* @see startText
|
|
|
|
*/
|
|
|
|
virtual uint setText(const TQString &text, const TQString &talker) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Say a plain text job. This is a convenience method that
|
|
|
|
* combines setText and startText into a single call.
|
|
|
|
* @param text The message to be spoken.
|
|
|
|
* @param talker Code for the talker to do the speaking. Example "en".
|
|
|
|
* If NULL, defaults to the user's default plugin.
|
|
|
|
* If no plugin has been configured for the specified Talker code,
|
|
|
|
* defaults to the closest matching talker.
|
|
|
|
* @return Job number.
|
|
|
|
*
|
|
|
|
* Plain text is parsed into individual sentences using the current sentence delimiter.
|
|
|
|
* Call setSentenceDelimiter to change the sentence delimiter prior to
|
|
|
|
* calling setText.
|
|
|
|
* Call getTextCount to retrieve the sentence count after calling setText.
|
|
|
|
*
|
|
|
|
* The text may contain speech mark language, such as Sable, JSML, or SSML,
|
|
|
|
* provided that the speech plugin/engine support it. In this case,
|
|
|
|
* sentence parsing follows the semantics of the markup language.
|
|
|
|
*
|
|
|
|
* The job is marked speakable.
|
|
|
|
* If there are other speakable jobs preceeding this one in the queue,
|
|
|
|
* those jobs continue speaking and when finished, this job will begin speaking.
|
|
|
|
* If there are no other speakable jobs preceeding this one, it begins speaking.
|
|
|
|
*
|
|
|
|
* @see getTextCount
|
|
|
|
*
|
|
|
|
* @since KDE 3.5
|
|
|
|
*/
|
|
|
|
virtual uint sayText(const TQString &text, const TQString &talker) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Adds another part to a text job. Does not start speaking the text.
|
|
|
|
* @param text The message to be spoken.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
* @return Part number for the added part. Parts are numbered starting at 1.
|
|
|
|
*
|
|
|
|
* The text is parsed into individual sentences. Call getTextCount to retrieve
|
|
|
|
* the sentence count. Call startText to mark the job as speakable and if the
|
|
|
|
* job is the first speakable job in the queue, speaking will begin.
|
|
|
|
*
|
|
|
|
* @see setText.
|
|
|
|
* @see startText.
|
|
|
|
*/
|
|
|
|
virtual int appendText(const TQString &text, uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Queue a text job from the contents of a file. Does not start speaking the text.
|
|
|
|
* @param filename Full path to the file to be spoken. May be a URL.
|
|
|
|
* @param talker Code for the talker to do the speaking. Example "en".
|
|
|
|
* If NULL, defaults to the user's default talker.
|
|
|
|
* If no plugin has been configured for the specified Talker code,
|
|
|
|
* defaults to the closest matching talker.
|
|
|
|
* @param encoding Name of the encoding to use when reading the file. If
|
|
|
|
* NULL or Empty, uses default stream encoding.
|
|
|
|
* @return Job number. 0 if an error occurs.
|
|
|
|
*
|
|
|
|
* Plain text is parsed into individual sentences using the current sentence delimiter.
|
|
|
|
* Call setSentenceDelimiter to change the sentence delimiter prior to calling setText.
|
|
|
|
* Call getTextCount to retrieve the sentence count after calling setText.
|
|
|
|
*
|
|
|
|
* The text may contain speech mark language, such as Sable, JSML, or SSML,
|
|
|
|
* provided that the speech plugin/engine support it. In this case,
|
|
|
|
* sentence parsing follows the semantics of the markup language.
|
|
|
|
*
|
|
|
|
* Call startText to mark the job as speakable and if the
|
|
|
|
* job is the first speakable job in the queue, speaking will begin.
|
|
|
|
*
|
|
|
|
* @see getTextCount
|
|
|
|
* @see startText
|
|
|
|
*/
|
|
|
|
virtual uint setFile(const TQString &filename, const TQString &talker,
|
|
|
|
const TQString& encoding) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Get the number of sentences in a text job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
* @return The number of sentences in the job. -1 if no such job.
|
|
|
|
*
|
|
|
|
* The sentences of a job are given sequence numbers from 1 to the number returned by this
|
|
|
|
* method. The sequence numbers are emitted in the sentenceStarted and
|
|
|
|
* sentenceFinished signals.
|
|
|
|
*/
|
|
|
|
virtual int getTextCount(uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Get the job number of the current text job.
|
|
|
|
* @return Job number of the current text job. 0 if no jobs.
|
|
|
|
*
|
|
|
|
* Note that the current job may not be speaking. See isSpeakingText.
|
|
|
|
*
|
|
|
|
* @see getTextJobState.
|
|
|
|
* @see isSpeakingText
|
|
|
|
*/
|
|
|
|
virtual uint getCurrentTextJob() = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Get the number of jobs in the text job queue.
|
|
|
|
* @return Number of text jobs in the queue. 0 if none.
|
|
|
|
*/
|
|
|
|
virtual uint getTextJobCount() = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Get a comma-separated list of text job numbers in the queue.
|
|
|
|
* @return Comma-separated list of text job numbers in the queue.
|
|
|
|
*/
|
|
|
|
virtual TQString getTextJobNumbers() = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Get the state of a text job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
* @return State of the job. -1 if invalid job number.
|
|
|
|
*
|
|
|
|
* @see kttsdJobState
|
|
|
|
*/
|
|
|
|
virtual int getTextJobState(uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Get information about a text job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
* @return A TQDataStream containing information about the job.
|
|
|
|
* Blank if no such job.
|
|
|
|
*
|
|
|
|
* The stream contains the following elements:
|
|
|
|
* - int state - Job state.
|
|
|
|
* - TQCString appId - DCOP senderId of the application that requested the speech job.
|
|
|
|
* - TQString talker - Talker Code requested by application.
|
|
|
|
* - int seq - Current sentence being spoken. Sentences are numbered starting at 1.
|
|
|
|
* - int sentenceCount - Total number of sentences in the job.
|
|
|
|
* - int partNum - Current part of the job begin spoken. Parts are numbered starting at 1.
|
|
|
|
* - int partCount - Total number of parts in the job.
|
|
|
|
*
|
|
|
|
* Note that sequence numbers apply to the entire job. They do not start from 1 at the beginning of
|
|
|
|
* each part.
|
|
|
|
*
|
|
|
|
* The following sample code will decode the stream:
|
|
|
|
@code
|
|
|
|
TQByteArray jobInfo = getTextJobInfo(jobNum);
|
|
|
|
TQDataStream stream(jobInfo, IO_ReadOnly);
|
|
|
|
int state;
|
|
|
|
TQCString appId;
|
|
|
|
TQString talker;
|
|
|
|
int seq;
|
|
|
|
int sentenceCount;
|
|
|
|
int partNum;
|
|
|
|
int partCount;
|
|
|
|
stream >> state;
|
|
|
|
stream >> appId;
|
|
|
|
stream >> talker;
|
|
|
|
stream >> seq;
|
|
|
|
stream >> sentenceCount;
|
|
|
|
stream >> partNum;
|
|
|
|
stream >> partCount;
|
|
|
|
@endcode
|
|
|
|
*/
|
|
|
|
virtual TQByteArray getTextJobInfo(uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Given a Talker Code, returns the Talker ID of the talker that would speak
|
|
|
|
* a text job with that Talker Code.
|
|
|
|
* @param talkerCode Talker Code.
|
|
|
|
* @return Talker ID of the talker that would speak the text job.
|
|
|
|
*/
|
|
|
|
virtual TQString talkerCodeToTalkerId(const TQString& talkerCode) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Return a sentence of a job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
* @param seq Sequence number of the sentence.
|
|
|
|
* @return The specified sentence in the specified job. If no such
|
|
|
|
* job or sentence, returns "".
|
|
|
|
*/
|
|
|
|
virtual TQString getTextJobSentence(uint jobNum=0, uint seq=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Determine if kttsd is currently speaking any text jobs.
|
|
|
|
* @return True if currently speaking any text jobs.
|
|
|
|
*/
|
|
|
|
virtual bool isSpeakingText() const = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Remove a text job from the queue.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
*
|
|
|
|
* The job is deleted from the queue and the textRemoved signal is emitted.
|
|
|
|
*
|
|
|
|
* If there is another job in the text queue, and it is marked speakable,
|
|
|
|
* that job begins speaking.
|
|
|
|
*/
|
|
|
|
virtual ASYNC removeText(uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Start a text job at the beginning.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
*
|
|
|
|
* Rewinds the job to the beginning.
|
|
|
|
*
|
|
|
|
* The job is marked speakable.
|
|
|
|
* If there are other speakable jobs preceeding this one in the queue,
|
|
|
|
* those jobs continue speaking and when finished, this job will begin speaking.
|
|
|
|
* If there are no other speakable jobs preceeding this one, it begins speaking.
|
|
|
|
*
|
|
|
|
* The textStarted signal is emitted when the text job begins speaking.
|
|
|
|
* When all the sentences of the job have been spoken, the job is marked for deletion from
|
|
|
|
* the text queue and the textFinished signal is emitted.
|
|
|
|
*/
|
|
|
|
virtual ASYNC startText(uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Stop a text job and rewind to the beginning.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
*
|
|
|
|
* The job is marked not speakable and will not be speakable until startText
|
|
|
|
* or resumeText is called.
|
|
|
|
*
|
|
|
|
* If there are speaking jobs preceeding this one in the queue, they continue speaking.
|
|
|
|
*
|
|
|
|
* If the job is currently speaking, the textStopped signal is emitted,
|
|
|
|
* the job stops speaking, and if the next job in the queue is speakable, it
|
|
|
|
* begins speaking.
|
|
|
|
*
|
|
|
|
* Depending upon the speech engine and plugin used, speech may not stop immediately
|
|
|
|
* (it might finish the current sentence).
|
|
|
|
*/
|
|
|
|
virtual ASYNC stopText(uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Pause a text job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
*
|
|
|
|
* The job is marked as paused and will not be speakable until resumeText or
|
|
|
|
* startText is called.
|
|
|
|
*
|
|
|
|
* If there are speaking jobs preceeding this one in the queue, they continue speaking.
|
|
|
|
*
|
|
|
|
* If the job is currently speaking, the textPaused signal is emitted and the job
|
|
|
|
* stops speaking. Note that if the next job in the queue is speakable, it does
|
|
|
|
* not start speaking as long as this job is paused.
|
|
|
|
*
|
|
|
|
* Depending upon the speech engine and plugin used, speech may not stop immediately
|
|
|
|
* (it might finish the current sentence).
|
|
|
|
*
|
|
|
|
* @see resumeText
|
|
|
|
*/
|
|
|
|
virtual ASYNC pauseText(uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Start or resume a text job where it was paused.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
*
|
|
|
|
* The job is marked speakable.
|
|
|
|
*
|
|
|
|
* If the job is currently speaking, or is waiting to be spoken (speakable
|
|
|
|
* state), the resumeText() call is ignored.
|
|
|
|
*
|
|
|
|
* If the job is currently queued, or is finished, it is the same as calling
|
|
|
|
* @see startText .
|
|
|
|
*
|
|
|
|
* If there are speaking jobs preceeding this one in the queue,
|
|
|
|
* those jobs continue speaking and when finished this job will begin
|
|
|
|
* speaking where it left off.
|
|
|
|
*
|
|
|
|
* The textResumed signal is emitted when the job resumes.
|
|
|
|
*
|
|
|
|
* @see pauseText
|
|
|
|
*/
|
|
|
|
virtual ASYNC resumeText(uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Get a list of the talkers configured in KTTS.
|
|
|
|
* @return A TQStringList of fully-specified talker codes, one
|
|
|
|
* for each talker user has configured.
|
|
|
|
*
|
|
|
|
* @see talkers
|
|
|
|
*/
|
|
|
|
virtual TQStringList getTalkers() = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Change the talker for a text job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
* @param talker New code for the talker to do the speaking. Example "en".
|
|
|
|
* If NULL, defaults to the user's default talker.
|
|
|
|
* If no plugin has been configured for the specified Talker code,
|
|
|
|
* defaults to the closest matching talker.
|
|
|
|
*/
|
|
|
|
virtual ASYNC changeTextTalker(const TQString &talker, uint jobNum=0 ) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Get the user's default talker.
|
|
|
|
* @return A fully-specified talker code.
|
|
|
|
*
|
|
|
|
* @see talkers
|
|
|
|
* @see getTalkers
|
|
|
|
*/
|
|
|
|
virtual TQString userDefaultTalker() = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Move a text job down in the queue so that it is spoken later.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
*
|
|
|
|
* If the job is currently speaking, it is paused.
|
|
|
|
* If the next job in the queue is speakable, it begins speaking.
|
|
|
|
*/
|
|
|
|
virtual ASYNC moveTextLater(uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Jump to the first sentence of a specified part of a text job.
|
|
|
|
* @param partNum Part number of the part to jump to. Parts are numbered starting at 1.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
* @return Part number of the part actually jumped to.
|
|
|
|
*
|
|
|
|
* If partNum is greater than the number of parts in the job, jumps to last part.
|
|
|
|
* If partNum is 0, does nothing and returns the current part number.
|
|
|
|
* If no such job, does nothing and returns 0.
|
|
|
|
* Does not affect the current speaking/not-speaking state of the job.
|
|
|
|
*/
|
|
|
|
virtual int jumpToTextPart(int partNum, uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Advance or rewind N sentences in a text job.
|
|
|
|
* @param n Number of sentences to advance (positive) or rewind (negative) in the job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* If zero, applies to the last job queued by the application,
|
|
|
|
* but if no such job, applies to the current job (if any).
|
|
|
|
* @return Sequence number of the sentence actually moved to. Sequence numbers
|
|
|
|
* are numbered starting at 1.
|
|
|
|
*
|
|
|
|
* If no such job, does nothing and returns 0.
|
|
|
|
* If n is zero, returns the current sequence number of the job.
|
|
|
|
* Does not affect the current speaking/not-speaking state of the job.
|
|
|
|
*/
|
|
|
|
virtual uint moveRelTextSentence(int n, uint jobNum=0) = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Add the clipboard contents to the text queue and begin speaking it.
|
|
|
|
*/
|
|
|
|
virtual ASYNC speakClipboard() = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Displays the %KTTS Manager dialog. In this dialog, the user may backup or skip forward in
|
|
|
|
* any text job by sentence or part, rewind jobs, pause or resume jobs, or
|
|
|
|
* delete jobs.
|
|
|
|
*/
|
|
|
|
virtual void showDialog() = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Stop the service.
|
|
|
|
*/
|
|
|
|
virtual void kttsdExit() = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Re-start %KTTSD.
|
|
|
|
*/
|
|
|
|
virtual void reinit() = 0;
|
|
|
|
|
|
|
|
/**
|
|
|
|
* Return the KTTSD deamon version number.
|
|
|
|
* @since KDE 3.5
|
|
|
|
*/
|
|
|
|
virtual TQString version() = 0;
|
|
|
|
//@}
|
|
|
|
|
|
|
|
k_dcop_signals:
|
|
|
|
void ignoreThis();
|
|
|
|
|
|
|
|
/** @name DCOP Signals */
|
|
|
|
//@{
|
|
|
|
|
|
|
|
/**
|
|
|
|
* This signal is emitted when KTTSD starts or restarts after a call to reinit.
|
|
|
|
*/
|
|
|
|
void kttsdStarted();
|
|
|
|
/**
|
|
|
|
* This signal is emitted just before KTTSD exits.
|
|
|
|
*/
|
|
|
|
void kttsdExiting();
|
|
|
|
/**
|
|
|
|
* This signal is emitted when the speech engine/plugin encounters a marker in the text.
|
|
|
|
* @param appId DCOP application ID of the application that queued the text.
|
|
|
|
* @param markerName The name of the marker seen.
|
|
|
|
*
|
|
|
|
* @see markers
|
|
|
|
*/
|
|
|
|
void markerSeen(const TQCString& appId, const TQString& markerName);
|
|
|
|
/**
|
|
|
|
* This signal is emitted whenever a sentence begins speaking.
|
|
|
|
* @param appId DCOP application ID of the application that queued the text.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* @param seq Sequence number of the text.
|
|
|
|
*
|
|
|
|
* @see getTextCount
|
|
|
|
*/
|
|
|
|
void sentenceStarted(const TQCString& appId, uint jobNum, uint seq);
|
|
|
|
/**
|
|
|
|
* This signal is emitted when a sentence has finished speaking.
|
|
|
|
* @param appId DCOP application ID of the application that queued the text.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* @param seq Sequence number of the text.
|
|
|
|
*
|
|
|
|
* @see getTextCount
|
|
|
|
*/
|
|
|
|
void sentenceFinished(const TQCString& appId, uint jobNum, uint seq);
|
|
|
|
|
|
|
|
/**
|
|
|
|
* This signal is emitted whenever a new text job is added to the queue.
|
|
|
|
* @param appId The DCOP senderId of the application that created the job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
*/
|
|
|
|
void textSet(const TQCString& appId, uint jobNum);
|
|
|
|
|
|
|
|
/**
|
|
|
|
* This signal is emitted whenever a new part is appended to a text job.
|
|
|
|
* @param appId The DCOP senderId of the application that created the job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
* @param partNum Part number of the new part. Parts are numbered starting
|
|
|
|
* at 1.
|
|
|
|
*/
|
|
|
|
void textAppended(const TQCString& appId, uint jobNum, int partNum);
|
|
|
|
|
|
|
|
/**
|
|
|
|
* This signal is emitted whenever speaking of a text job begins.
|
|
|
|
* @param appId The DCOP senderId of the application that created the job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
*/
|
|
|
|
void textStarted(const TQCString& appId, uint jobNum);
|
|
|
|
/**
|
|
|
|
* This signal is emitted whenever a text job is finished. The job has
|
|
|
|
* been marked for deletion from the queue and will be deleted when another
|
|
|
|
* job reaches the Finished state. (Only one job in the text queue may be
|
|
|
|
* in state Finished at one time.) If startText or resumeText is
|
|
|
|
* called before the job is deleted, it will remain in the queue for speaking.
|
|
|
|
* @param appId The DCOP senderId of the application that created the job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
*/
|
|
|
|
void textFinished(const TQCString& appId, uint jobNum);
|
|
|
|
/**
|
|
|
|
* This signal is emitted whenever a speaking text job stops speaking.
|
|
|
|
* @param appId The DCOP senderId of the application that created the job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
*
|
|
|
|
* The signal is only emitted if stopText() is called and the job is currently
|
|
|
|
* speaking.
|
|
|
|
*/
|
|
|
|
void textStopped(const TQCString& appId, uint jobNum);
|
|
|
|
/**
|
|
|
|
* This signal is emitted whenever a speaking text job is paused.
|
|
|
|
* @param appId The DCOP senderId of the application that created the job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
*/
|
|
|
|
void textPaused(const TQCString& appId, uint jobNum);
|
|
|
|
/**
|
|
|
|
* This signal is emitted when a text job, that was previously paused, resumes speaking.
|
|
|
|
* @param appId The DCOP senderId of the application that created the job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
*/
|
|
|
|
void textResumed(const TQCString& appId, uint jobNum);
|
|
|
|
/**
|
|
|
|
* This signal is emitted whenever a text job is deleted from the queue.
|
|
|
|
* The job is no longer in the queue when this signal is emitted.
|
|
|
|
* @param appId The DCOP senderId of the application that created the job.
|
|
|
|
* @param jobNum Job number of the text job.
|
|
|
|
*/
|
|
|
|
void textRemoved(const TQCString& appId, uint jobNum);
|
|
|
|
//@}
|
|
|
|
};
|
|
|
|
|
|
|
|
#endif // _KSPEECH_H_
|