




免费预览已结束,剩余7页可下载查看
下载本文档
版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
Computer Communications 23 (2000) 15941605/locate/comcomOn object initialization in the Java bytecodeqS. Doyon*, M. DebbabiLSFM Research Group, Department of Computer Science, Laval University, Sainte Foy, Que., Canada G1K 7P4AbstractJava is an ideal platform for implementing mobile code systems, not only because of its portability but also because it is designed with security in mind. Untrusted Java programs can be statically analyzed and validated. The programs behavior is then monitored to prevent potentially malicious operations. Static analysis of untrusted classes is carried out by a component of the Java virtual machine called the verier. The most complex part of the verication process is the dataow analysis, which is performed on each method in order to ensure type-safety. This paper claries in detail one of the tricky aspects of the dataow analysis: the verication of object initialization. We present and explain the rules that need to be enforced and we then show how verier implementations can enforce them. Rules for object creation require, among other things, that uninitialized objects never be used before they are initialized. Constructors must properly initialize their this argument before they are allowed to return. This paper also deals with initialization failures (indicated by exceptions): the object being initialized must be discarded, and constructors must propagate initialization failures. q 2000 Elsevier Science B.V. All rights reserved.Keywords: Java bytecode; Object initialization; Dataow analysis; static analysis; java security1. IntroductionThe Java architecture is particularly well-suited for implementing mobile code systems. A mobile code archi-tecture allows a computer to fetch a program (or parts of a program) from a network source and execute it locally. However, security is a critical aspect of mobile code archi-tectures. The very essence of mobile code is to execute a program that originates from a remote source. This is inher-ently dangerous because it is not known what actions that program will take. By executing the mobile code, we are allowing it to perform operations on our machine and we are giving it access to our local resources.Java is especially well-suited for implementing mobile code systems for three reasons: Java source is compiled into a platform-independent intermediate form called Java bytecode. Java byte-code is then interpreted by the JVM (Java virtual machine). This makes Java bytecode completely portable, which means a piece of Java code in compiled form should run on any receiving machine.q The research reported in this paper has been supported by the National Science and Engineering Research Council (NSERC), the Fonds pour la formation de chercheurs et laide a la recherche (FCAR), and the Defense Research Establishment Valcartier (DREV), Department of National Defense.* Corresponding author. Tel.: _1-41-8656-7035; fax: _1-41-8656-2324.E-mail address: doyonift.ulaval.ca (S. Doyon). It is dynamically linked: the JVM will load classes from different network sources as they are needed and will link them into the program while it runs. The Java architecture is built with security in mind: its design makes it possible to enforce sufcient security to make mobile code safe and practical.Currently, the most popular manifestation of Java mobile code is applets. A JVM (bytecode interpreter) is incor-porated in web browsers. Web pages can then include links that point to the compiled (bytecode) form of programs which are called applets. The applet can then be loaded by the browser and executed locally with no special effort on the users part.The verier is a key component of the Java security archi-tecture. Its role is to examine compiled classes as they are loaded into the JVM in order to ensure that they are well-formed and valid. It checks that the code respects the syntax of the bytecode language and that it respects the language rules. Another component of the Java security architecture, called the security manager, monitors access to system resources and services. The security manager is a security layer, which goes on top of the verier and relies on its effectiveness.The most complex step of the verication process performed by the verier requires running a dataow analy-sis on the body of each method. There are a few particularly tricky issues regarding the dataow analysis. In this paper, we focus on the issues relating to the initialization of0140-3664/00/$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.PII: S 0 1 4 0 - 3 6 6 4 ( 0 0 ) 0 0 2 4 5 - 0S. Doyon, M. Debbabi / Computer Communications 23 (2000) 159416051595new objects: Issues relating to object creation: A new object is createdin two steps: space is allocated for the new object, and then it is initialized. When performing the dataow analysis, the verier must ensure that certain rules are respected: the constructor used to initialize an object must be appropriate, an object must not be used before it is initialized, an object must not be initialized more than once and initialization failures (indicated by exceptions) must be handled properly. Issues relating to constructors: The constructor is respon-sible for initializing a new object. The rst part of the constructors work performs initialization from a typing point of view, which implies directly or indirectly calling a constructor from the superclass. The rest of the constructor performs application-specic initialization. The verier must ensure that a constructor properly initi-alizes the current object before it returns, that it does not use the current object in any way before calling the super-class constructor and that it propagates any initialization failure occurring in the superclass constructor.The Ofcial documentation on the verier, provided in (Ref. 1, Sections 4.8 and 4.9) and in Ref. 2, is relatively sparse; the portions discussing object initialization are very brief, vague, and leave out some important issues. Indepen-dent work presented in Ref. 3 has claried many aspects. Freund and Mitchell have extended the formalization of a subset of the Java bytecode language introduced in Ref. 4. They used a type system to describe the veriers handling of object initialization. Our paper reviews and explains the rules related to object initialization and discusses how a verier implementation can enforce them. We also touch on a few issues not discussed in Ref. 3. Exceptions thrown during object initialization indicate initialization failures and must be handled properly, both inside and outside of a constructor. We also provide a comprehensive, intuitive explanation of how the rules for object creation can be enforced with minimal effort.We assume that the reader has some knowledge of the Java bytecode language, as well as a basic understanding either of dataow analysis in general or of the particular analysis technique used by the Java bytecode verier. The unfamiliar reader may consult the following references for more complete information: for the Java language the reader may refer to the ofcial specication of the language 5. The best way to learn Java or to nd a more understandable explanation of its concepts is to read Ref. 6. For details on the Java standard library, see Ref. 7. The workings of the JVM and the bytecode instruction set are described in the ofcial JVM specication 1. For a lighter approach, see Ref. 8. To gain a good understanding of the Java bytecode language, it is necessary to experiment with it. Two tools are essential: a class le disassembler, that will print out a class le (and in particular the bytecode) in a readable format.Suns javap tool, which comes with the JDK can be used for this, although other alternatives are available. A byte-code assembler, that produces class les from some source with a manageable syntax. Otherwise, constructing binary class les by hand would be difcult and time consuming. A great solution is the excellent jasmin 9.This paper is organized as follows. Section 2 provides a brief overview of the dataow analysis in order to show the context in which verication of object initialization occurs. Section 3 deals with the creation of new objects, while Section 4 explains the special requirements imposed on constructors. Each of these sections rst presents the neces-sary rules that the verier must somehow enforce, and then discusses how an implementation could achieve the desired result. Section 5 shows that constructors may leak or save a copy of their this reference, which means that it is possible for incompletely initialized objects to be actually used. Section 6 lists some of the related work. Some concluding remarks are ultimately sketched as a conclusion in Section 7.2. Dataow analysisThe Java bytecode verier ensures that the classes loaded by the JVM do not compromise the security of the system, either through disrespect of the language rules or through compromise of the integrity of the virtual machine. The verier validates many syntactical aspects of the class le. It validates eld and method declarations. It makes some checks relating to the superclass. It veries references to other classes, other methods and elds and it enforces access restriction mechanisms (like protected, private and nal). The body of each method is examined in turn: each byte-code instruction and its operands are validated.The most complex yet most interesting part of the veri-cation process is the dataow analysis. It is performed inde-pendently on each method. The dataow analysis checks that each bytecode instruction gets arguments of the proper type (from the stack or from the registers), detects and prevent overows and underows of the expression evaluation stack and ensures that subroutines are used consistently. The dataow analysis also must check that object initialization is performed correctly. This paper will attempt to clarify the properties that need to be enforced on object creation and constructors. We will also propose ways in which a verier implementation can enforce those rules.In order to perform the dataow analysis, it is necessary to keep track of the type of each value on the stack and in the registers at each program point. We will assume that each instruction of a method constitutes a program point, although it is possible to use fundamental blocks of instruc-tions as program points. The type, which is recorded by the dataow analysis for a given location at a given program point must be consistent, irrespective of the execution path used to reach that program point. When there is a conict1596S. Doyon, M. Debbabi / Computer Communications 23 (2000) 15941605because two or more paths would yield different types of values for the same location, then we record for that location a common supertype of all the types that could actually occur. For instance, if at a given program point a certain loca-tion could contain either an instance of FileInputStream or an instance of ByteArrayInputStream, the dataow analysis merges the two types and records the type Input-Stream instead. If there are no common supertypes for the possible types in a certain location, then the type unusable is used, indicating that the value cannot be used by the following instructions. This generalization of types does imply a loss of information and precision. This is what makes the analysis conservative, in the sense that it is pessimistic.Types used in the dataow analysis are primitive types (single-word int or oat or double-word long or double) and reference types (the types associated to references to objects or arrays). A reference type may be a class, interface or array type (which species a base type and a number of dimensions). The type returnAddress will be used to describe the return address to a subroutine, as created by the jsr instruction. The special type named unusable is used to mark uninitialized registers. The special reference type null is used to represent the type of null references produced by the aconst_null instruction. Also note that implementations will generally use other special types to represent allocated but not yet initialized objects.3. Object creationCreating a new object is done in two steps. First, space for the object is allocated through the use of the new instruction, which returns a reference that points to the newly allocated memory space. Then, the object is initialized by invoking one of its constructors (a method named kinitl).For example, the Java statementnew String()is translated to the following bytecode instructions:; allocate space for String and push; reference to it onto the stacknew java/lang/String; duplicate top stack item (reference to; newly allocated space)dup; call String.String() constructor, uses; up one of the references to newly allocated; space as this argument.invokespecial java/lang/String/ kinitl()V; This leaves a reference to the new; String object on the stack.The constructor is responsible for putting the object in a valid state. Until initialization of the new object completes, its state remains undened and may be inconsistent. The language semantics therefore disallows using a newly allo-cated object before it is initialized. Enforcing this is one of the veriers responsibilities. The verier must keep track of which object is initialized and which is not, ensure that proper constructors are used to initialize new objects and make sure that uninitialized objects are not used before they are initia-lized. This is one of the tricky points of the dataow analysis. Ref. 1 covers this aspect briey. Ref. 3 presents a detailed analysis and formal specication of the language rules related to object initialization. Unfortunately, neither Refs. 3 nor 1 discuss the interaction between object initialization and exception handlers. We will rst discuss the rules that the verier should enforce, and we will then consider how a verier implementation can enforce them.3.1. RulesThe verier must enforce the following properties: An object must not be used before it is initialized. An uninitialized object must be initialized by one of the constructors declared in its class. A constructor fromanother class cannot be used. Notice that methods named kinitl are not inherited. An object must not be initialized more than once. If an exception is thrown by the call to the instance initialization method, then the new object must not be used because its initialization is incomplete.We rst discuss what it means for an uninitialized object (or rather a reference to it) to be used. The reference pushed onto the stack by the new instruction should be considered to have a special type, indicating that the object it points to is not initialized. The verier must allow moving and copying the reference on the stack and into registers. Any other use of the reference must be disallowed. To be precise:Copying the reference to and from registers using aload and astore is permitted.Moving the reference around on the stack through swap, pop and its variants is permitted. Duplicating the refer-ence through dup and its variants is also allowed.Putting the reference in an object or a class eld through puteld or putstatic is not allowed. Accessing elds of the uninitialized object itself (through geteld or puteld) is not allowed either. This means that the new reference is unacceptable as either of the two arguments of puteld.The reference must not be passed as a parameter to a method or used to designate the object on which a method is called. It is therefore disallowed as any of the parameters of invokevirtual, invokespecial and invokeinterface, except of course that anS. Doyon, M. Debbabi / Computer Communications 23 (2000) 159416051597kinitl methods can be invoked on it by invokespe-cial.The reference may not be thrown as an exception by athrow.The current method may not return the new reference through areturn.The new reference may not be stored into an array through aastore.The references type may not be checked through checkcast or instanceof.The monitor of the new object may not be accessed through monitorenter or monitorexit.A newly allocated object can be initialized by calling one of its constructors (instance initialization methods, named kinitl). Only the invokespecial instruction may be used to invoke such methods. When the constructor returns, the object is considered to have been properly initialized. Classes may provide several constructors (methods named kinitl) with different signatures. There is no restriction as to which constructor should be called. In fact, the class being instantiated might not have been linked yet and the verier might not even know which constructors are available: exis-tence of the constructor will be checked during resolution, in the same way as any other method invocation.Invoking a method named kinitl is a special case for invokespecial. The verier should validate the para-meters being passed to the method as it would normally. The reference indicating on which object the method is being invoked should be a reference to an uninitialized object of the proper type: that is, a reference to an uninitialized object of the same type as the class from which the kinitl method is taken. Suppose a class named C is being instantiated. An instruction of the formne
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2025版智慧城市投资入股合同示范文本
- 2025年度预付款担保公司针对艺术品交易合同
- 2025殡葬服务绿色殡葬理念推广与应用合同
- 2025年度文化旅游区特色商铺租赁合作协议
- 2025年度高端企业市场分析及战略规划商务咨询服务合同
- 2025年度地铁隧道建设第三方监理合同范本
- 2025版体育馆室内篮球场租赁及配套器材供应合同
- 2025年特色民宿改造升级设计与施工合同
- 2025年度能源行业财务担保合同发行与会计处理规范
- 2025版新型生态绿化工程承包合同下载
- 区域检验信息管理系统规划书
- 冷库安全基本知识培训课件
- 澄海玩具行业出口中存在的问题及对策分析
- 工业园区集中供热配套建设项目可行性研究报告
- 2025年应急管理普法知识竞赛题(附答案)
- 2024-2030全球飞机拆解再制造行业调研及趋势分析报告
- 数控铣床实训安全教育
- 物业前台倒水服务方案(3篇)
- 2025年职业指导师考试试卷:职业指导师职业指导
- 常减压装置仿真操作正常停车石油炼制装置操作02课件
- 2025年科技创新企业财务工作总结及计划
评论
0/150
提交评论