版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、元数据和数据模型元数据和数据模型杭诚方杭诚方 教授教授metadatametadatametadata is very important in the data warehouse environment. metadata is often described as data about data. metadata contains information on the location, the structure, and meaning of data, mapping information, and a guide to the algorithms used for summ
2、arization between detail and summary data. metadata metadata metadata contains detailed descriptions of the location, structure, and meaning of data; keys and indexes of the data; the algorithms and business rules used to transform and summarize data.metadata is used throughout the dw, from extracti
3、on stage through the access stage. metadata is used throughout the dw, from extraction stage through the access stage. metadatametadata metadata answers the following types of question: what information is available, by subject area, and when did we start collecting that data? how was this summariza
4、tion created? what queries are available to access the data? what business assumptions have been made? how do i find the data i need? how old is the data? what does that value mean? metadata metadata metadata can be classified into:technical metadata that contains information about data warehouse da
5、ta for use by data warehouse designers and administrators when carrying out data warehouse development and management tasks. business metadata contains information that gives users an easy-to-understand perspective of the information stored in the data warehouse.data warehouse operational informatio
6、n. operational informationoperational informationdata history (snapshots, versions); data ownership; data extract audit trail; data usage data;used by the load, management, and access processes for scheduling data loads or end user access. metadata usersmetadata userschoosing the metadata locationch
7、oosing the metadata location where it is stored is product-specific, the metadata resides in the database and usually on the data warehouse server. this is the preferred method. metadata may be located on a separate database on another machine. what is data modelingwhat is data modelingdata modeling
8、 has been an art that first gained recognition since dr. peter chens 1976 article which illustrated his new-found approach called entity-relationship modeling. since then it has become the standard approach used towards designing databases. by properly modeling an organizations data, the database de
9、signer can eliminate data redundancies which are a key source for in-accurate information and ineffective systems. why data modeling is importantwhy data modeling is importantvisualization of the business world: generally speaking, a model is an abstraction and reflection of the real world.the essen
10、ce of the database architecture: the data model plays the role of a guideline, or plan, to implement the database.data warehouse modelingdata warehouse modelinghow should the data warehouse databases be designed to best support the needs of the data warehouse users? answering that question is the ta
11、sk of the data modeler. data modeling is, by necessity, part of every data processing task, and data warehousing is no exception. three types of models three types of models in dw environmentin dw environmentit is important to understand the three types of models involved in the transformation proce
12、ss from the operational environment to a decision support system: the corporate data model the data warehouse data model the departmental data warehouse design the corporate data modelthe corporate data model the corporate data model is an enterprise-wide view of the data and its relationships. it n
13、ormally includes a high-level model which is an overview of each subject data area and the relationships between them, as well as logical data models for each subject data area. these models are the basis for developing both the enterprises online transaction processing (oltp) systems and data wareh
14、ouses. the corporate data model is a very good place to start the process of building a data warehouse. it provides a foundation for integration and unification at an intellectual level. the corporate data modelthe corporate data modelthe data warehouse data modelthe data warehouse data model the da
15、ta warehouse data model is sometimes referred to as an enterprise data warehouse model or data warehouse design. it represents an integrated, subject-oriented, and very granular base of strategic information which serves as a single source for the decision support environment. the data warehouse dat
16、a model maintains this integrated, detailed level of information so that all the departments and other internal organizations of the enterprise can benefit from a consistent, integrated source of decision support information. corporate data model to data corporate data model to data warehouse model
17、transformationwarehouse model transformation once the enterprise has a corporate data model, the transformation process into the data warehouse data model can begin:removal of purely operational dataaddition of an element of time to the key structure of the data warehouse if one is not already prese
18、ntaddition of appropriate derived datatransformation of data relationships into data artifactsaccommodating the different levels of granularity found in the data warehousemerging like data from different tables togetherremoving operational dataremoving operational data adding an element of time to t
19、he adding an element of time to the warehouse keywarehouse key adding derived dataadding derived data creating relationship artifactscreating relationship artifacts changing granularity of datachanging granularity of data merging tablesmerging tables the conditions are: the tables share a common key (or partial key) the data from the different
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2026年武威市农村信用社联合社秋季校园招聘笔试备考题库(浓缩500题)及答案详解(各地真题)
- 2025年广东省江门市辅警考试真题及答案
- 本溪市农村信用社联合社秋季校园招聘笔试备考题库(浓缩500题)附答案详解(模拟题)
- 2026年莆田市农村信用社联合社秋季校园招聘笔试备考题库(浓缩500题)含答案详解(轻巧夺冠)
- 贵港市农村信用社联合社秋季校园招聘笔试备考题库(浓缩500题)及答案详解(名校卷)
- 2025年高校探访职业试题及答案
- 2025年高校辅导员招聘面试题集及参考答案
- 2026年邯郸市农村信用社联合社秋季校园招聘笔试备考题库(浓缩500题)有答案详解
- 衡阳市农村信用社联合社秋季校园招聘笔试备考题库(浓缩500题)含答案详解(综合卷)
- 2025年云南省特种作人员取证培训以及特种设备作业人员取证培训考试氟化工艺作业复习题库及答案
- 2024年执法资格考试题库(附答案)
- 2024年深圳市龙华建设发展集团有限公司招聘笔试冲刺题(带答案解析)
- 药师竞聘正高述职报告
- 公务员心理健康与调适讲座
- 昇兴(安徽)包装有限公司年产 18 亿只铝制两片罐项目环境影响评价报告书
- 企业电气安全事故案例分析
- 2023学年完整公开课版液压方枕器
- 2023年度环保管家服务招标文件
- 固定式人字抱杆整立施工作业指导书
- 犬胃切开术的课件资料
- 天津某钢厂高速线材主轧线设备安装方案年产万吨
评论
0/150
提交评论