分布式DBMS的概念与设计 计算机等专业毕业设计(论文)外文翻译.doc_第1页
分布式DBMS的概念与设计 计算机等专业毕业设计(论文)外文翻译.doc_第2页
分布式DBMS的概念与设计 计算机等专业毕业设计(论文)外文翻译.doc_第3页
分布式DBMS的概念与设计 计算机等专业毕业设计(论文)外文翻译.doc_第4页
分布式DBMS的概念与设计 计算机等专业毕业设计(论文)外文翻译.doc_第5页
已阅读5页,还剩4页未读 继续免费阅读

VIP免费下载

版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领

文档简介

大连交通大学2008届本科生毕业设计(论文)外文翻译The Concepts and Design of Distributed DBMS1. INTRODUCTION A major behind the development of database systems is the desire to integrate the operational data of an organization and to provide controlled access to the data. Although integration and controlled access may imply centralization, this is not the intention. In fact, the development of computer networks promotes a decentralized mode of work. This decentralized approach mirrors the organizational structure of many companies, which are logically distributed into divisions, departments, projects, and so on, and physically distributed into offices, plants, factories, where each nit maintains its own operational data. The shareability of the data and the efficiency of data access should be improved by the development of a distributed database system that reflects this organizational structure, makes the data in all units accessible, and stores data proximate to the location where it is most frequently used. Distributed DBMSs should help resolve the islands of information problem. Databases are sometimes regarded, as electronic islands that are distinct and generally inaccessible places, like remote islands. This may be a result of geographical separation, incompatible computer architectures, incompatible communication protocols, and so on. Integrating the databases into a logical whole may prevent this way of thinking. 2Concepts To start the discussion of distributed DBMSs, we first give a definition of a distributed database. Distributed database: a logically interrelated collection of shared data physically distributed over a computer network. Following on from this we have the definition of distributed DBMS. Distributed DBMS: the software system that permits the management of the distributed database and makes the distribution transparent to users.A distributed database management system consists of a single logical database that is split into a number of fragments. Each fragment is stored on one or more computers under the control of a separate DBMS, with the computers connected by a communications network. Each site is capable of independently processing user requests that require access to local data and is also capable of processing data stored on other computers in the network.Users access the distributed database via application. Applications are classified as those that do not require data from other sites and those that do require data from other sites. We require a DBMSs to have at least one global application. A DDBMS therefore has the following characteristics:A collection of logically related shared data;The data is split into a number of fragments;Fragments may be replicated;Fragments/replicas are allocated to sites;The sites are linked by a communications network;The data at each site is under the control of a DBMS;The DBMS at each site can handle local applications, autonomously;Each DBMS participates in at least one global application;From the definition of the DDBMS, the system is expected to make the distribution transparent to the user. Thus, the fact that a distributed database is split tinto fragments that can be stored on different computers and perhaps replicated, should be hidden from the user. The objective of transparency is to make the distributed system appear like a centralized system. This is sometimes referred to as the fundamental principle of distributed DBMSs.Advantages and Disadvantages of DDBMSsThe distribution of data and applications has potential advantages over traditional centralized database systems. Unfortunately, there are also disadvantages. In this section, we review the advantages and disadvantages of the DDBMS.AdvantagesReflects organizational structureMany organizations are naturally distributed over several locations. For example, DreamHome has many officers in different cities. It is natural for databases used in such an application to be distributed over these locations. DreamHome may keep a database at each branch office containing details of such things as the staff who work at that location, the properties that are for rent, and the clients whoown or wish to rent out these properties. The staff at a branch office will make local inquiries of the databases. The company headquarters may wish to make global inquiries involving the access of data at all or a number of branches.Improved shareability and local autonomyThe geographical distribution of an organization can be reflected in the distribution of the data; users at one site can access data stored at other sties. Data can be placed at the site close to the users who normally use that data. In this way, users have local control of the data, and they can consequently establish and enforce local policies regarding the use of this data. A global database administrator is responsible for the entire system. Generally, part of this responsibility is devolved to the local level, so that the local DBA can manage the local DBMS.Improved availabilityIn a centralized DBMS, a computer failure terminates the operations of the DBMS. However, a failure at one site of a DBMS, or a failure of a communication link making some sites inaccessible, does not make the entire system inoperable. Distributed DBMSs are designed to continue to function despite such failures. If a single node fails, the system may be able to reroute the failed nodes requests to another site.Improved reliabilityAs data may be replicated so that it exists at more than one site, the failure of a node or a communication link does not necessarily make the data inaccessible.Improved performance As the data is located near the site of greatest demand, and given the inherent parallelism of distributed DBMSs, speed of database access may be better than that achievable form a remote centralized database. Furthermore, since each site handles only a part of the entire database, there may not be the same contention for CPU and I/O services as characterized by a centralized DBMS.EconomicsIn the 1960s,computing power was calculated according to the square of the costs of the equipment: three times the cost would provide nine times the power. This was known as Grouchs Law. However, it is now generally accepted that it costs much less to create a system of smaller computers with the equivalent power of a single large computer. This makes it more cost-effective for corporate divisions and departments to obtain separate computers. It is also much more cost-effective to add workstations to a network than to update a mainframe system.The second potential cost saving occurs where databases are geographically remote and the applications require access to distributed data. In such cases, owing to the relative expense of data being transmitted across the network as opposed to the cost of local access, it may be much more economical to partition the application and perform the processing locally at each site.Modular growthIn a distributed environment, it is much easier to handle expansion. New sites can be added to the network without affecting the operations of other sites. This flexibility allows an organization to expand relatively easily. Increasing database size can usually be handled by adding processing and storage power to the network. In a centralized DBMS, a growth may entail changes to both hardware and software.ComplexityA distributed DBMS that hides the distributed nature form the user and provides an acceptable level of performance, reliability is inherently more complex than a centralized DBMS. The fact that data replication adequately, there will be degradation in availability, reliability, and performance compared with the centralized system, and the advantages we cited above will become disvantages.CostIncreased complexity means that we can expect the procurement and maintenance costs for a DDBMS to be higher than those for a centralized DBMS. Furthermore, a distributed DBMS requires additional hardware to establish a network between sites. These are ongoing communication costs incurred with the use of this network. There are also additional labor costs to manage and maintain the local DBMSs and the underlying network.SecurityIn a centralized system, access to the data can be easily controlled. However, in a distributed DBMS not only does access to replicated data have to be controlled in multiple locations, but the network itself has to be made secure. In the past, networks were regarded as an insecure communication medium. Although this is still partially true, significant developments have been made to make network more secure.Integrity control more difficultDatabase integrity refers to the validity and consistency of stored data. Integrity is usually expressed in terms of constraints, which are consistency rules that the database is not permitted to violate. Enforcing integrity constraints generally requires access to a large amount of data that defines the constraint but which is not involved in the actual update operation itself. In a distributed DBMS, the communication and processing costs that are required to enforce integrity constraints may be prohibitive. We return to this problem in Section Lack of standards Although distributed DBMSs depend on effective communication, we are only now starting to see the appearance of standard communication and data access protocols. This lack of standards has significantly limited the potential of distributed DBMSs. There are also no tools or methodologies to help users convert a centralized DBMS into a distributed DBMS.Lack of experienceGeneral-purpose distributed DBMSs have not been widely accepted, although many of the protocols and problems are well understood. Consequently, we do not yet have the same level of experience in industry as we have with centralized DBMSs, For a prospective adopter of this technology, this may be a significant deterrent.Database design more complexBesides the normal difficulties of designing a centralized database, the design of a distributed database has to take account of fragmentation of data, allocation of fragments to specific sites, and data replication. 分布式DBMS的概念与设计1 介绍推动数据库系统发展的一个主要因素是,人们希望将一个企业的操作数据综合起来并能提供对数据的受控访问,经管综合和受控访问意味着集中管理,但这并不是目的。实际上,计算机网络的发展促进了分散式的作业模式。这种分散的方式反映了许多公司的组织结果:在逻辑上分成多个分公司,部门,项目等,而在体制上分为办公室,车间,工厂,每一个单元都维护着自己的操作数据。数据的共享和数据访问效率的提高以来与分布式数据库系统的发展,分布式数据库反映了上述组织结构,使得每个单元的数据都是可访问的,并将最近的数据存放于最常用的位置。分布式DBMS有助于结局信息孤岛的问题,有时,数据库被看做是孤立的,不可访问的电子岛,就像遥远的岛屿一样。这可能是由于地理分割,计算机体系结构不兼容或通信协议不兼容原因造成的。把数据库综合成一个逻辑整体可能会改变这种思考的方式。2. 概念在开始讨论分布式DBMS之前,先给出分布式数据库的定义。分布式数据库:物理上分布于计算机网络中,但在逻辑上相关的共享的集合。由此可以得到分布式DBMS的定义。分布式DBMS:管理分布式数据库并使分布性对用户透明的软件系统。分布式数据库管理系统是由一个被氛围多段的逻辑数据库构成的。每个段在独立的DBMS的控制下,可以存储在一个或多个通过通信网络互联起来的计算机上。每一个结点都可以独立的处理用户访问本地数据的请求,并且也可以处理网络上其他计算机存储的数据。用户是通过听用来访问分布市数据库的,应用又可氛围不需要从其他结点获得数据的应用和确实需要从其他结点获得数据的应用,一般要求DDBMS至少包含一个全局应用。因此,DDBMS应当具有如下特征:l 逻辑上相关的共享数据的集合。l 数据是分段的。l 段是可以复制的。l 段/副本是分配在各个结点上的。l 结点是用通信网络连接起来的。l 每个结点的数据都由DBMS控制,l 每个结点的DBMS都能自主地处理本地应用。l 每个DBMS至少参与一个全局应用。从DDBMS的定义中可以看出,系统期望分布性对于用户来讲是透明的。这样,用户就不需要知道分布式数据库是分段的,存储在多个不同计算机上,并且可能被复制等细节。透明性的目的就是要使用户使用分布式系统如同使用集中式系统一样。这常常被称为DDBMS的基本原则。这个要求为终端用户提供了强大的功能。DDBMS的优缺点数据分布以及应用分布比传统的集中式数据库具有潜在的优势,但也存在着不足。优点:反映了组织结构许多组织都是自然的分布与各个地方。例如,DreamHome在许多城市都设有分支机构。于是该应用的数据库将很自然的分布于这些不同的地方。DreamHome的每个分支机构都有一个数据库用来记录该机构的职员信息,出租房产信息以及房产所有者的信息。本地的职员可以在本地数据库上进行本地查询,而公司的高层则可以访问所有分支机构的任何数据,进行全局查询。改进了共享性和本地自主权数据的分布可以反映出一个组织在地理上的分布,一个结点的用户可以访问其他结点上的数据。数据会存放在靠近经常是同这些数据的用户的结点上。这样,拥护就可以对数据拥有本地控制权,从而可以建立和执行关于使用这些数据的本地策略。全局数据管理员(DBA)对整个系统负责。通常也可以把责任部分地下放到本地级,所以本地DBA可以管理本地DBMS。改进了可用性对于集中式DBMS,计算机的一次故障会中断所有的DBMS操作。然而,在DDBMS系统中,一个结点的故障或通信链路的故障只会使某些结点不能被访问,但决不会中断整个系统的操作。分布式DBMS就是设计用来使系统在这些故障发生时仍然可以继续工作。即使一个结点出现故障,系统也可以把对故障结点的访问请求重定向到其他结点。提高了可靠性因为数据可以被复制而存在与多个结点之上,所以一个结点的鼓掌或通信链路故障不会妨碍对该数据的访问。改进了性能因为数据是防止在离“最大需求“最近的结点上的,而且由于分布式DBMS内在的并行机制,所以访问分布式数据库的速度肯定比访问远程集中式

温馨提示

  • 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
  • 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
  • 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
  • 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
  • 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
  • 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
  • 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。

评论

0/150

提交评论