版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、项目预研Engine 项目组华技大学软件学院2005信 息父项名称父项标识版本子项文档名称子项文档标识版本修 改 信 息修 改 者日 期旧版本修改标识原 因审 核日 期新版本批 准日 期配 置 信 息项目名称移动项目标识Engine-BM-2005-01版 本1.0文档名称项目预研文档标识PDS-2005-01版 本1.0编 辑撰写人时 间2005-5-15版 本1.0审 核 批 准审 核日 期2005-5-17版 本1.0批 准日 期2005-5-17项目预研1 引言1.1 编写目的项目预研目的旨在梳理语音开发流程,确定应用开发框架。1.2 背景项目名称:移动(b-mobile 让随身移动)
2、项目委托项目开发系统开发:华:华技大学软件学院技大学软件学院Engine 项目组:VS.NET、Speech Application SDK VerBeta 1.11.3 定义Speech Application SDK: 微软语音应用开发包1.4 参考资料Speech Application SDK 开系统需求说明档2 应用程序组织模式Speech Application SDK 程序模式之2.1基础篇(SALT)The SAPI API provides a high-levelerface betspeech engines. SAPI implements all the low-le
3、veln an application ands needed tocontrol and manage the real-time operations of various speech engines.The two basic types of SAPI engines are text-to-speech (TTS) systems andspeechspoken spokenrecognizers. TTS systems synthesize text strings and filesoaudio using synthetic voi. Speech recognizers
4、convert humanaudioo readable text strings and files.Speech Application SDK 程序模式之2.2componentsoftheSpeechApplication PlatformRequired ComponentsDeploying a speech-enabled Web application using SALT markuprequires three components.1.An ASP.NET serverThe Web server generates Wges containing HTML, SALT,
5、and embedded script. The script controls the dialogue flow forvoice-onlyeractions. For example, if there are severalprompts on a page, the script defines the order in which the audio prompts play.2.A Speech ServerSpeech Server recognizes speech, and plays audio prompts and responses.3.A cntThe Speec
6、h Platform supports two types of cnts:ephonyApplication Servicnts, and multimodal cnts with averofernet ernetExplorer running either Speech Add-in for Explorer or Speech Add-in forPocketernet Explorer.The following diagram illustrates these elements and the types ofinformation they pros. It also ill
7、ustrates the relationship of theseelements to the Visual Studio .NET 2003 Speech Development Tools.3Common Usage ScenariosThis section illustrates three deployment configurations for commondeployment scenariost the Speech Platform supports.3.1ephony Scenariohis scenario,ephony Application Servi(TAS)
8、 is the cnt. Aephone acts as the terminal device, and connects to TAS through astandardephony board. Theephony board provides theerfacebetn theephone and TAS. At run time, TAS res on the Webserver for application logic, and on Speech Server for audio signalprosing.When the user dials a phone number
9、for aephony service, the callconnects to TAS. TAS assotes theephone call wivoice-onlySALTreter. Then TAS connects to the Web server and loads thedefault page for the applicationt provides the service for which thecaller is dialing. As the callereracts with the application, TASpasses audio and dual t
10、one multi-frequency (DTMF) input from thecaller to Speech Server, which performs speech recognition (SR),text-to-speech (TTS), and DTMF prosing.The SASDK includes a number of Dialog Speech Controlst supportComputer-Supportedmunications Applications (CSTA)servi. These include the AnswerCall, Transfer
11、Call, MakeCall,and DisconnectCall controls. Developers can use these controls toanswer, transfer, initiate, and disconnectephone calls, as well asgather call information, and send and receive CSTA events. TheSASDK also includes a SmexMessage (Simple Messaging Exten) controlt developers can use to se
12、nd and receive raw CSTA messages.3.2Desktop Multimodal Scenariohis scenario, the cnt isernet Explorer with SpeechAdd-in forernet Explorer installed. ASP.NETspeech-enabled Web application pages reside on the Web server.When the user enters a URL inernet Explorer, the Web serveropens the applications
13、default page. The Web server sends HTML,SALT, and JScript to the Speech Add-in on the desktop. SALT markuphe pagest the Web server sends to the cnt trigger speechrecognition and text-to-speech synthesis. In order to implement SALTfunctionality, at run time the Speech Add-in instantiates a sharedSAPI
14、 SR engine. If nesary, the Speech Add-in also instantiates aTTS and a prompt engine on the cnt. These engines on the desktopcnt perform all prompting, speech recognition, and text-to-speechsynthesis.Note Multimodal applications using a desktop c nt can be deployed using only the SASDK.3.3Windows Mob
15、iMultimodal Scenarioased Pocket PC 2003 (Pocket PC)his scenario, the cnt is Pocketernet Explorer with the SpeechAdd-in forPocketernet Explorer installed. ASP.NETspeech-enabled Web application pages reside on the Web server,along with the application grammars, and a configuration filecontaining the U
16、RL to the Speech Servert performs speechprosing.When the user enters a URL on Pocket PC, the Web server opens theapplications default .aspx page. The Web server also sends the URLpoing to Speech Server. The paget the Web server sendscontains HTML, SALT, and JScript. When the user taps aspeech-enable
17、d HTML element and talks, Pocket PC sends the audio to Speech Server. Along with the compressed audio, Pocket PC sendseither an inline recognition grammar, or a poer to the location of anexternally-d recognition grammart is bound totspeech-enabled element. If the recognition grammar is an inlinegram
18、mar, Speech Server loads the grammar and performs speechrecognition. If the grammar is an externally-d grammar, SpeechServerdownloads a copy of the grammar, loads the grammar,and then performs speech recognition.After the recognizer finishes, Speech Server sends SemMarkupLanguage (SML) output to the Pocket PC along wiudio for promptsif the application dialogue flow requires the appli
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2026四川泸州市龙马潭区第二人民医院招收见习人员23人笔试模拟试题及答案解析
- 2026重庆飞驶特人力资源管理有限公司成都分公司外派至四川某高速公路运行监测调度中心监控员招聘2人笔试参考题库及答案解析
- 2026中国科大基本建设处劳务派遣岗位招聘4人笔试模拟试题及答案解析
- 2026上海市商业学校工作人员公开招聘笔试备考题库及答案解析
- 2026年云南国土资源职业学院单招职业适应性测试题库附答案详细解析
- 2026中华全国总工会所属事业单位面向社会招聘22人笔试备考题库及答案解析
- 2026年3月广东广州市天河第三实验幼儿园编外聘用制专任教师招聘1人笔试备考试题及答案解析
- 2026广东第二师范学院B类岗位管理人员招聘3人笔试备考试题及答案解析
- 2026年浙教版重点名校初三下学期教学质量监测(一)英语试题试卷含解析
- 河北省石家庄市新乐市重点名校2026年初三5月阶段性教学质量检测试题英语试题理含解析
- 凝血机制医学PPT
- 《花卉生产技术》课件第十一章 水生花卉
- 警械使用课件
- 国家免费孕前优生健康检查项目技术服务
- 规模经济与范围经济 课件
- 2005年全国高中数学联赛试题及答案
- 【教学课件】地区产业结构变化 示范课件
- 降低呼吸机管路积水发生率QCC
- 留样观察记录表
- 崔允漷建构新教学心得体会(共13篇)
- DB32∕T 1553-2009 江苏省高速公路工程工程量清单计价规范
评论
0/150
提交评论