




已阅读5页,还剩37页未读, 继续免费阅读
版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
关于数学公式语音 项目的调研,报告人:彭云辉 2010-4-8,纲要,国外关于数学语音的一些相关的研究 这些项目表示数学公式所采用的语言/格式 相关项目的大体思路及意图 如何消除歧义 项目的优点和局限性及对项目的改进意见,主要相关项目及对应机构或人物,另外,还有一些机构、团体对数学公式语音作了深入的研究,并取得了一定的成果。如University of California at Berkeley(How can we speak math/Math Speak & Write, a Computer Program to Read and Hear Mathematical Input) 和 School of Computing, Dublin City University (Mathematics: How and What to Speak)等。,RETURN,数学表达式源语言,RETURN,an excellent prototype for speaking mathematics LaTeX to audio documents. Can speak both literary texts and highly technical documents that contain complex mathematics. the adequacy of the audio rendering depends on how well the electronic document captures the essential internal structure of the information. produced a structured representation and an audio formatting language (AFL) to provide an interactive environment for listening to and browsing technical documents. uses the Emacs(文本编辑器) front-end (Linux).,有关AsTeR( Audio System for Technical Readings ) (Raman 1994),AsTeR(续),creates an internal representation easier,used to help the audio rendering,Mathtalk的大体思路及意图 1. a set of rules to insert prosodic cues into spoken algebraic expressions. 2. analyzed the way mathematics teachers speak mathematical expressions and integrated these natural voice inflections(音调变化). 3. only insertion of prosodic cues (pitch, amplitude, and pauses) into computer-spoken mathematical expressions ,none insertion of lexical cues.,creates the necessary HTML/XML tags for visually-impaired and blind users to use their current screen-reading tools (e.g. JAWS and Window-Eyes for Windows) to read HTML and MathML/XML pages that contain math expressions, to read them in a spoken language.,This markup is not displayed in the browser. Only the MathML visual markup, or a PNG image, or a LiveMath Plug-In interactive image - whatever the author intended, is shown. The “MathSpeak This“ function makes it possible to hear the expression read during the creation/editing process,大体思路: 1. deriving Braille and Spoken Output from LaTeX Documents 2. render spoken mathematics from MathML using prosodic features such as pauses and speaking rate . 3. the use of prosody in synthesized speech to indicate nesting structure. 主要意图: To take a subset of LaTeX and produce both Braille and Spoken out from it. To accurately model a document and to present this to the blind user using a simple and intuitive interface. To harness the capabilities of synthetic speech devices to give more meaningful spoken output to the user.,TechRead的大体思路及意图,AudioMath的大体思路及意图 made use of its own database of prosodic rules in the generation of the spoken expressions. Available in 4 different ways: - ActiveX DLL - .NET component - CGI interface - Executable EXE,Auto-Discovery (the “brain” of the operation that recognizes or identifies elements in the document and calls the respective conversion modules ) Numerals (conversion of several types of numeric forms) Abbreviations Acronyms Network References Mathematical (MathML expressions ),6 modules for the conversion part,AudioMath设计流程图,MathPlayer,a plug-in to Microsoft Internet Explorer (IE) and Adobe Acrobat/Reader that renders MathML visually. is able to dynamically display a mathematical expression according to its font and the color set, users can choose the most suitable font or color scheme for their reading needs. For example, visually impaired readers are likely to set a large font and high color contrast.,上述公式在MathPlayer中的读法为: cap U bar sup h equals one minus exponent open minus fraction 8 cap T sup h over end fraction close,读法: equals ln open fraction n over s end close plus open fraction k sup h over k prime sup h end fraction close ln open s close minus zero point seven five plus open two l z minus z squared close fraction k sup h over q sup w end fraction.,MathSpeak Project的大体思路及意图 The project is one of the proposed methods, consisting of a group of rules to dictate mathematical contents. However it is not a standard and it is intended to serve blind people that want to transcribe their documents into Nemeth Code 18, and later on into Braille.,RETURN,如何消除歧义,两大策略:,Use of lexical indicators (a) x plus begin fraction one over x end fraction minus one (b) begin fraction xplusone over x end fraction minus one Use of prosodic indicators(pauses, modifications of pitch and tempo, rhythm and tone) (a) x plus one over x minus one (b) “ xplus one over x minus one,在消除歧义这一部分,MathPlayer没什么优势,而AudioMath在这一方面做得很不错,其余的一些相关软件也没什么出众的地方,下面着重谈谈AudioMath,AudioMath(葡萄牙语),Lexical Square root of power base a exponent two, end of power, plus power base b exponent two, end of power, end of radicand Prosodic Square root of (LP) a squared (SP) plus b squared (LP) end of radicand,AudioMath tone rules:,1- Rising tone: used when a lower hierarchical level is starting. (root of) 2- Falling and Rising tone: used to mark the smaller separating pause. (a squared) 3- Falling tone: used when level is ended. (b squared) 4- Emphatic Falling tone: used at the end of the expression that simultaneously is the higher hierarchical level (end of radicand).,LP,SP,LP,RETURN,AudioMath优点:,supports usermode options. An example : 1.25 one point twenty five OR one point two five,Future Work of AudioMath - Complete the support for MathML Content Markup - Study in more detail mathematical prosody - Implement a proper blind tool - Add more languages - Enhancements on XHTML support - Implement SAPI, SSML support for TTS technologies,MathPlayer局限性,the use of tables and the representation of matrices and the possibility of some ambiguous readings no math formulae navigation support. gets complicated with complex math expressions no provision for any kind of user adapted preferences scheme(usermode) has ambiguous rendering in some mathematical expressions. Does not use prosody to render mathematical expressions by speech output. It generates text strings made up of the names of mathematical symbols and commas and periods to set pauses.,MathPlayer优势,allows web browsers users to copy a MathML expression and paste it in a MathML-aware program. This is particularly useful for computation, but might also be useful when used in conjunction with other software aimed at making math accessible (e.g. the LAMBDA system) or with mainstream applications used to process scientific documents (e.g. MathType or Scientific Notebook).,Changes in MathPlayer 2.2,MathPlayer 2.2 (released February 2010) is an upgrade and includes the following: Significantly improved font handling and rendering: Improved support for STIX, Cambria, and other Unicode fonts. Improvements for anti-aliased rendering. Better protection against fonts that contain errors in their tables. More characters are displayed. Improved performance when Internet Explorers zoom is not 100%. Improved compatibility with ASCIIMathML. Fixed bugs with content MathML (handling of , “Copy MathML“),Future work of MathPlayer,MathPlayers speech rules are based upon a pattern matcher/rule system. The rules are able to specify synchronization points and prosody in addition to text to speak. The rules provide a great deal of flexibility and allow users to match structures such as limits and integrals so that they are spoken in the customary manner rather than treating them as general expressions with limits and/or scripts.,Future work of MathPlayer (续),The downside to this power is that
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 应急安全教育培训感想课件
- 2023年度重庆资源与环境保护职业学院单招《物理》全真模拟模拟题附参考答案详解【完整版】
- 2024施工员题库附答案详解(夺分金卷)
- 计算机四级真题(能力提升)附答案详解
- 2025年咨询工程师高分题库【原创题】附答案详解
- 私人之间供货合同(标准版)
- 授权公司合同(标准版)
- 农业土地租赁合同(标准版)
- 订购门窗合同(标准版)
- 2025年中级软考综合提升测试卷完整附答案详解
- 退役军人优抚政策课件
- 财务遴选笔试题及答案
- (2025秋新版)人教版二年级数学上册全册教案(教学设计)
- 六年级上册音乐课教案
- 肿瘤病人疼痛评估与干预策略
- 物业管理人员考核制度及标准
- 计算机视觉技术课件
- 大学书法教学课件
- 河北省科技工程学校招聘真题2024
- 茶叶出口培训课件
- 家电行业售后服务组织架构及人员岗位职责
评论
0/150
提交评论