版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、学位论文OAIMETS元数据生成的实现(Implementation of OAIMETS metadata generation in Dissertation)Through the analysis of the thesis OAIXML and TRS metadata format, according to the CALIS standards and norms, proposed a program design method in OAIXML metadata generation realization, can provide a solution for the
2、non - standard service system.As an important primary data resource with independent intellectual property rights, academic theses play an important role in teaching and scientific research. Compared with the databaseconstruction model, the joint construction of University Library Dissertations Data
3、base because of the cost, quality, distribution pattern advantage, has become a trend of the construction of characteristic database under the background of regional resource sharing in recent years.Taking Jiangsu Province as an example, from the library of Nanjing University of Aeronautics & Astron
4、autics and other 19 Universities Participating in the JALIS Dissertations Database project, the Dissertations Service System through the construction of distinctive features, wide coverage, wide income, in order to promote the Jiangsu within the university teaching and research resources, improve th
5、e level of teaching and scientific research in Colleges and universities in Jiangsu. The project has entered the third issue. With reference to the relevant standards and specifications of CALIS, the metadata standards and standards for masters degree theses have been established. In the early stage
6、 of construction, the establishment of complete construction of dissertation database, the local museum JALIS thesis center harvest platform and services portal and provide better service; 25729 D-papers metadata harvesting success 11 participation hall, which is the first 16 pages of the PDF data,
7、the rate of follow-up (77.9% data harvesting work is still in progress). In Jiangsu Province, readers in universities and colleges are free to search online, check papers metadata and the first 16 pages of the paper, and get the full text by means of online original text delivery.The actual operatio
8、n of the project of modern library and information technology platform, there are several problems of the construction of the museum is not uniform, especially in the early construction of the system does not provide OAI export data interface, the metadata provided does not conform to the system spe
9、cification or metadata harvesting can not be harvested 16 page full-text data harvesting server can not be real-time harvesting for the museum and metadata server the need for manual intervention, also need to further improve the quality of data.In the library of Yangzhou University as an example, t
10、he dissertation service system by the Beijing TRS Information Technology Limited (TRS company), online can complete the thesis submission, review, cataloging, publishing, retrieval, statistics and other functions, is divided into submission and management, librarian editing, paper retrieval module 4
11、 support OAI, but does not support the export data manual system can provide local services and provide the OAI metadata to the center platform, but because of the absence of the first 16 pages of text, thus affecting the center platform metadata rate.The project team has compared with the service s
12、ystem for the dissertation submission, Dissertation Service System in China in recent years, the main use of TRS, TPI, IDL and founder, and the other based on independent research and development of the service system, the system should be chosen as recommended by the CALIS system, to achieve commun
13、ication with the CALIS master the central portal and interoperability. In 19 participation Museum, Yangzhou University and Hohai University were selected for the TRS system, built earlier time, software upgrades and technical support of the data has not been normal harvest center platform;If the wil
14、l to expand the scope of the library of Tsinghua University, led by University participation in the CALIS master thesis library, the use of TRS and other early users of the system more, export data there has been a problem. As a data provider, general colleges and universities use manual export or s
15、elf research and development interface for data submission operations, often lead to incomplete data and non-standard, or interface program variety.According to the local service system thesis and developed an interface program, the METS and OAI unified package for a file, the metadata is smooth cen
16、ter platform harvest, and to keep the 16 page full-text center platform data rate. According to the program CALIS thesis, metadata description rules of CALISOAI and METS data export standard data standard, mainly for the TRS system, through the tool library of Yangzhou University were each paper gen
17、erating standard OAIXML file format, and the first 16 pages containing PDF full text of the METS packet can be JALIS degree thesis the central database successful harvest.And the related ideas of data analysis OAI-METS file specification for interoperability of OAI metadata, distributed through the
18、network construction the library must ensure that each paper metadata records is derived to generate a separate XML file format (XML), the file should be consistent with the requirements of export data format). From the XML file name format to each element, the element of modifiers and modifiers of
19、English label encoding system and necessary each metadata item or repeatability requirements, are required to follow the rules descriptive metadata related regulations now supports OAI metadata and METS metadata generated in a file in the OAI2.0 standard.The record format XML file has two file encod
20、ing formats: UTF-8 and format. Since GBK encoding does not support multilingual languages, encoding formats are usually used. The main part is the Record record identification section, which consists of three parts, Header (record header description section), Metadata (OAI metadata description secti
21、on) and About (METS metadata description section).The OAI metadata refers to the two part of the Record logo, Metadata, according to the CALIS standards and specifications, the metadata can be written to the XML file, the central database to harvest, but did not include the first 16 pages of text. T
22、he METS metadata refers to the About part of Records logo, METS is mainly on the metadata generation thesis before the 16 page PDF text encoding Base64 can refer to CALISMETS and CALIS_OAI package structure specification and data export specification requirements, the thesis METS data is written to
23、a XML file, OAI-METS metadata.XML file naming rules XML file naming should follow standard naming format, first string: the complete zero time zone time to file this string format conversion: character into%3A; conversion character / %2F. For example, metadata splicing strings, application practices
24、, and finally exported filenames.The Base64 encoding rule uses 64 basic ASCII code characters to encode the data, and 3byte, as a set, splits the encoded data into byte arrays. Sequentially, the 24bit data is arranged and divided into 4 groups,That is the most high in each group before the meeting o
25、f two 0 make up a byte, so put a 3byte for a group of data re encoding into 4byte. When the number of bytes of data to be encoded is not an integer multiple of 3, that is, when the last set in the packet is not enough, 3byte fills 1 to 2 0 bytes in the last group and adds 1 to 2 after the encoding i
26、s completed.Before the TRS data flow and the characteristics of the TRS system module in the use of paper submitted online, first through the management module to submit the template settings online, the dissertation metadata field settings are needed, including some of the necessary field settings,
27、 then you can submit module metadata for dissertation entry submitted through TRS the online, including on the upload.After analysis, TRS thesis submitted online by management, the dissertation metadata records stored online submission trs_paper from the database (the table), the metadata table), tr
28、spaperattach (the path table) three data table. Generate the paper OAIXML file, all necessary fields are included in the three data tables.System development based on the above analysis, the program can use TRS database to provide the three data tables, extract the necessary fields, and strictly abi
29、de by the CALIS on the OAIXML file format, naming rules and encoding rules, for each paper generated OAIXML record file with CALIS standard.The implementation steps of the system query metadata records through the ADO data object, SQLServer ODBC connection thesis database, ADO connection string to t
30、he server IP; uid= user password; pwd=; database=paper , by SQL trspaper, and trspaperattach statements on the data table query thesis metadata records.The dissertation metadata records generated OAI-METS thesis metadata query string by reading reference standard, will be the first 16 pages of text
31、encoding PDF Base64, string to generate a complete OAI-METS based metadata.The metadata generated in the OAI-METS string, because of less than, greater than (), and (&), single and double quotation marks, this character is a predefined XML file entity, if there are 5 characters in this paper metadat
32、a field, without any treatment will be with the identifier conflict, resulting in XML parsing error, so the metadata generated OAI-METS string, it will be 5 characters into the corresponding symbol.The output OAIXML file with the traditional file I/O statements (OPEN, PRINT) will be generated as abo
33、ve OAI-METS degree thesis metadata string output to a file, as in OAIXML file, XML file name must comply with the file name naming format. In addition, the file encoding format generated by the method defaults to ANSI, so the file needs to be converted into UTF-8 again to keep the OAIXML file in lin
34、e with the actual format of the file encoding.The main functions of the system provide the annual, degree, subject, student number and other ways of retrieval, query the function of the relevant table of TRS thesis database, and provide the function of checking whether there is the PDF file of the f
35、irst 16 pages of the thesis,Only check containing 16 pages generated METS packets, to correct thesis data export, otherwise the system will ignore the OAI-METS data this thesis, and will not successfully exported data recorded in the log file, provide the basis for future of export. Yangzhou Univers
36、ity dissertation OAI.XML export data system interface, as shown in Figure 1 Figure 1 Yangzhou University degree thesis data export interface of the system through the CALIS degree thesis and local characteristic database system according to the number of modern library and information technology the
37、sis metadata file quality inspection module V2.0 test generation, only through testing, can be harvested the central database.Conclusion by developing the OAI-METS metadata program of the degree thesis system, it is possible to harvest the data of the local academic dissertations system which does n
38、ot provide standard OAI metadata. The program can be local * * each dissertation generated with CALIS standard metadata, finally realize the harvest thesis center database on the local paper metadata, which enables the exchange of metadata and the central thesis of local database sharing possible. T
39、his method can be used for the same dissertation service system works to provide a direct interface program available * *; used by different service system but also can provide standard data center construction provides a way.Journals are a collection of peer-reviewed open access journals in emerging interdisciplinary areas that complement the existing Springer subscription journals and BioMedC
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2026校招:广西电力公司笔试题及答案
- 初中体育立定跳远动作的踝关节力量训练与跳跃高度突破课题报告教学研究课题报告
- 辽宁省鞍山市台安县高级中学2026年高三级第三次统测生物试题试卷含解析
- 高中生研究金属离子催化对果蔬维生素C滴定测定结果的影响课题报告教学研究课题报告
- 2026年山西省晋中市祁县中学高三下学期自测卷(一)线下考试物理试题含解析
- 黑龙江省齐齐哈尔十一中学2026届高考数学试题3年高考模拟题透析2年模拟试题含解析
- 初中生物实验中生态缸制作与维护的实践研究课题报告教学研究课题报告
- 2026届厦门市高考物理试题倒计时模拟卷含解析
- 2026湖南常德市柳叶湖旅游度假区开发公益性岗位备考题库含答案详解(黄金题型)
- 2026辽宁省妇幼保健院招聘高层次和急需紧缺人才10人备考题库附参考答案详解(预热题)
- 中国人身保险业经验生命表2025
- 农业合作社管理与运营实操指南
- 外伤性脑出血病例分析与管理流程
- 大类资产配置量化模型研究系列之五:不同协方差矩阵估计方法对比分析
- 产前筛查设备管理制度
- 木工机械日常点检表
- (完整word)长沙胡博士工作室公益发布新加坡SM2考试物理全真模拟试卷(附答案解析)
- GB/T 4108-2004镁粉和铝镁合金粉粒度组成的测定干筛分法
- GB/T 12805-2011实验室玻璃仪器滴定管
- 优秀QC小组活动成果发布模板教学文案
- 规划环评资料清单
评论
0/150
提交评论