00项目预研语音_第1页
00项目预研语音_第2页
00项目预研语音_第3页
00项目预研语音_第4页
00项目预研语音_第5页
已阅读5页,还剩4页未读 继续免费阅读

付费下载

下载本文档

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

1、项目预研Engine 项目组华技大学软件学院2005信 息父项名称父项标识版本子项文档名称子项文档标识版本修 改 信 息修 改 者日 期旧版本修改标识原 因审 核日 期新版本批 准日 期配 置 信 息项目名称移动项目标识Engine-BM-2005-01版 本1.0文档名称项目预研文档标识PDS-2005-01版 本1.0编 辑撰写人时 间2005-5-15版 本1.0审 核 批 准审 核日 期2005-5-17版 本1.0批 准日 期2005-5-17项目预研1 引言1.1 编写目的项目预研目的旨在梳理语音开发流程,确定应用开发框架。1.2 背景项目名称:移动(b-mobile 让随身移动)

2、项目委托项目开发系统开发:华:华技大学软件学院技大学软件学院Engine 项目组:VS.NET、Speech Application SDK VerBeta 1.11.3 定义Speech Application SDK: 微软语音应用开发包1.4 参考资料Speech Application SDK 开系统需求说明档2 应用程序组织模式Speech Application SDK 程序模式之2.1基础篇(SALT)The SAPI API provides a high-levelerface betspeech engines. SAPI implements all the low-le

3、veln an application ands needed tocontrol and manage the real-time operations of various speech engines.The two basic types of SAPI engines are text-to-speech (TTS) systems andspeechspoken spokenrecognizers. TTS systems synthesize text strings and filesoaudio using synthetic voi. Speech recognizers

4、convert humanaudioo readable text strings and files.Speech Application SDK 程序模式之2.2componentsoftheSpeechApplication PlatformRequired ComponentsDeploying a speech-enabled Web application using SALT markuprequires three components.1.An ASP.NET serverThe Web server generates Wges containing HTML, SALT,

5、and embedded script. The script controls the dialogue flow forvoice-onlyeractions. For example, if there are severalprompts on a page, the script defines the order in which the audio prompts play.2.A Speech ServerSpeech Server recognizes speech, and plays audio prompts and responses.3.A cntThe Speec

6、h Platform supports two types of cnts:ephonyApplication Servicnts, and multimodal cnts with averofernet ernetExplorer running either Speech Add-in for Explorer or Speech Add-in forPocketernet Explorer.The following diagram illustrates these elements and the types ofinformation they pros. It also ill

7、ustrates the relationship of theseelements to the Visual Studio .NET 2003 Speech Development Tools.3Common Usage ScenariosThis section illustrates three deployment configurations for commondeployment scenariost the Speech Platform supports.3.1ephony Scenariohis scenario,ephony Application Servi(TAS)

8、 is the cnt. Aephone acts as the terminal device, and connects to TAS through astandardephony board. Theephony board provides theerfacebetn theephone and TAS. At run time, TAS res on the Webserver for application logic, and on Speech Server for audio signalprosing.When the user dials a phone number

9、for aephony service, the callconnects to TAS. TAS assotes theephone call wivoice-onlySALTreter. Then TAS connects to the Web server and loads thedefault page for the applicationt provides the service for which thecaller is dialing. As the callereracts with the application, TASpasses audio and dual t

10、one multi-frequency (DTMF) input from thecaller to Speech Server, which performs speech recognition (SR),text-to-speech (TTS), and DTMF prosing.The SASDK includes a number of Dialog Speech Controlst supportComputer-Supportedmunications Applications (CSTA)servi. These include the AnswerCall, Transfer

11、Call, MakeCall,and DisconnectCall controls. Developers can use these controls toanswer, transfer, initiate, and disconnectephone calls, as well asgather call information, and send and receive CSTA events. TheSASDK also includes a SmexMessage (Simple Messaging Exten) controlt developers can use to se

12、nd and receive raw CSTA messages.3.2Desktop Multimodal Scenariohis scenario, the cnt isernet Explorer with SpeechAdd-in forernet Explorer installed. ASP.NETspeech-enabled Web application pages reside on the Web server.When the user enters a URL inernet Explorer, the Web serveropens the applications

13、default page. The Web server sends HTML,SALT, and JScript to the Speech Add-in on the desktop. SALT markuphe pagest the Web server sends to the cnt trigger speechrecognition and text-to-speech synthesis. In order to implement SALTfunctionality, at run time the Speech Add-in instantiates a sharedSAPI

14、 SR engine. If nesary, the Speech Add-in also instantiates aTTS and a prompt engine on the cnt. These engines on the desktopcnt perform all prompting, speech recognition, and text-to-speechsynthesis.Note Multimodal applications using a desktop c nt can be deployed using only the SASDK.3.3Windows Mob

15、iMultimodal Scenarioased Pocket PC 2003 (Pocket PC)his scenario, the cnt is Pocketernet Explorer with the SpeechAdd-in forPocketernet Explorer installed. ASP.NETspeech-enabled Web application pages reside on the Web server,along with the application grammars, and a configuration filecontaining the U

16、RL to the Speech Servert performs speechprosing.When the user enters a URL on Pocket PC, the Web server opens theapplications default .aspx page. The Web server also sends the URLpoing to Speech Server. The paget the Web server sendscontains HTML, SALT, and JScript. When the user taps aspeech-enable

17、d HTML element and talks, Pocket PC sends the audio to Speech Server. Along with the compressed audio, Pocket PC sendseither an inline recognition grammar, or a poer to the location of anexternally-d recognition grammart is bound totspeech-enabled element. If the recognition grammar is an inlinegram

18、mar, Speech Server loads the grammar and performs speechrecognition. If the grammar is an externally-d grammar, SpeechServerdownloads a copy of the grammar, loads the grammar,and then performs speech recognition.After the recognizer finishes, Speech Server sends SemMarkupLanguage (SML) output to the Pocket PC along wiudio for promptsif the application dialogue flow requires the appli

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

最新文档

评论

0/150

提交评论