US 2002/0066086 A1 Linden (43) Pub

US 2002O066086A1 (19) United States (12) Patent Application Publication (10) Pub. No.: US 2002/0066086 A1 Linden (43) Pub. Date: May 30, 2002 (54) DYNAMIC RECOMPLER (52) U.S. Cl. ............................................ 717/145; 717/151 (76) Inventor: Randal N. Linden, Los Angeles, CA (57) ABSTRACT (US) A method for dynamic recompilation of Source Software Correspondence Address: instructions for execution by a target processor, which LU & LIU LLP considers not only the Specific Source instructions, but also 811 WEST SEVENTH STREET, SUITE 1100 the intent and purpose of the instructions, to translate and LOS ANGELES, CA 90017 (US) optimize a set of equivalent code for the target processor. The dynamic recompiler determines what the Source opera (21) Appl. No.: 09/755,542 tion code is trying to accomplish and the optimum way of Filed: Jan. 5, 2001 doing it at the target processor, in an “interpolative' and (22) context Sensitive fashion. The Source instructions are pro Related U.S. Application Data cessed in blocks of varying Sizes by the dynamic recompiler, which considers the instructions that come before and after (63) Non-provisional of provisional application No. a current instruction to determine the most efficient approach 60/175,008, filed on Jan. 7, 2000. out of Several available approaches for encoding the opera tion code for the target processor to perform the equivalent Publication Classification tasks Specified by the Source instructions. The dynamic compiler comprises a decoding Stage, an optimization Stage (51) Int. Cl. ................................................. G06F 9/45 and an encoding Stage. Patent Application Publication May 30, 2002 Sheet 1 of 3 US 2002/0066086 A1 Patent Application Publication May 30, 2002 Sheet 2 of 3 US 2002/0066086 A1 - \O, 2 Patent Application Publication May 30, 2002 Sheet 3 of 3 US 2002/0066086 A1 fo, 3 US 2002/0066086 A1 May 30, 2002 DYNAMIC RECOMPLER more target instructions to be executed in response to a given Source instruction. Thus, the given computer can execute RELATED APPLICATION target instructions of its own machine, and through emula tion execute Source instructions. 0001. This is a continuation of Provisional Application No. 60/175,008, filed on Jan. 7, 2000. 0009. Two main types of emulation strategies currently are available in the emulation field. The first strategy is BACKGROUND OF THE INVENTION known as “interpretation', in which each Source instruction is decoded in turn as it is addressed, causing a Small 0002) 1. Field of the Invention Sequence of target instructions then to be executed that emulate the Source instruction. The main component of an 0003. The present invention relates to a method and emulator is typically an interpreter that converts each apparatus for emulating Source instructions on a target instruction of any program in machine language A into a Set information processing System and, more particularly, to a of instructions in machine language B, where machine Scheme for translating Source instructions to target instruc language B is the code language of the target computer on tions executable on the target information processing Sys which the emulator is being used. In Some instances, inter tem. preters have been implemented in computer hardware or 0004 2. Description of Related Art firmware, thereby enabling relatively fast execution of the 0005 Information processing system or computing sys emulated programs. tems comes in many configurations. They include without 0010. The overhead, in terms of speed, for decoding each limitation the generally known computers (Such as personal Source instruction each time the Source instruction is called computers), personal digital assistant, Video game consoles, is relatively Small, but present for each instruction. Conse application specific control Systems, and other Systems and quently, a given Source instruction that is addressed and, devices that incorporate a processing unit. For example, hence, decoded, many times in the course of running a computers have general-purpose central processing units program will slow the execution time of the overall program (CPU) which are designed to execute a specific set of being run, i.e., Overhead will increase. instructions. 0011. The other main emulation strategy is known as 0006 A CPU of one family, such as the Motorola 680XO “translation', in which the Source instructions are analyzed family of processors, manufactured by Motorola, Inc., Phoe and decoded. This is also referred to as “recompilation” or nix, Ariz., executes instructions unique to this family, a CPU “cross-compilation”. It is well known that the execution of another family, such as the Intel 80X86 manufactured by Speed of computer programs is dramatically reduced by Intel Corp., Sunnyvale, Calif., executes instructions unique interpreters. It is not uncommon for a computer program to to that family, and a PowerPC processor family, used in a run ten to twenty times Slower when it is executed via Macintosh computer manufactured by Apple Computer, emulation than when the equivalent program is recompiled Inc., Cupertino, Calif., executes instructions Specific to that into target machine code and the target code version is family. These instructions comprise part of the operating executed. Due to the well known slowness of Software Systems of the respective computers in which the respective emulation, a number of products have Successfully CPUS reside. Typically, application Software programs are improved on the Speed of executing Source applications by then written to run on the respective operating Systems. dynamically cross-compiling portions of Such program at run time into target machine code, and then executing the 0007 As another example, video game manufacturers, recompiled program portions. While the cross-compilation Such as Sony, developed a game player based on a propri process typically takes 50 to 100 machine or clock cycles per etary hardware (processing) platform that runs only game instruction of the Source, the greater Speed of the resulting Software in a CD format which is specifically developed for target machine code is, on average, enough to improve the its platform. Sony game Software would not run on other hardware platforms. Game Software based on other plat overall Speed of execution of most Source applications. forms would not run on Sony players. 0012. The primary reason that overall execution speed is improved by cross-compilation is that most programs con 0008. A computer manufacturer will design a computer tain execution loops of instructions that are repeatedly system with a given CPU and will want to maximize market executed hundreds, thousands, or even millions of times penetration by having more rather than fewer applications during a typical execution of the program. The Source Software being able to run on its System. For example, as instructions are analyzed and decoded only once, i.e., the indicated above, Apple Computer manufactures the Pow first time they are addressed, and the target instruction erPC-based Macintosh line of computers. Applications soft Stream is generated and Stored in memory, usually a RAM ware that has been written to run on operating Systems for memory that may be a cache memory. By avoiding repeated the Motorola 680XO family, for example, may not all run on the PowerPC-based machines. Accordingly, a field of tech interpretation of the instructions in Such loops, Substantial nology has developed in which a given computer having one execution time is Saved. Consequently, Subsequent emula type of CPU, called a target, will include an emulator that tion of the same Source instruction may be performed allows the target computer to emulate the instructions, called quickly because the decoding overhead is nonexistent. the source, of another type of CPU. Thus, the target com 0013 While run time cross-compilation of source appli puter will have Stored in memory Source instructions that cations is well known to those skilled in the art, there are may be called in response to applications Software, target Several areas in which existing croSS-compilation Systems instructions emulating the Source instructions and execut have fallen short of their potential. For example, it is able by the target CPU, and an emulator that causes one or necessary to have a relatively large buffer or cache memory US 2002/0066086 A1 May 30, 2002 in the target computer. A block of memory in the target 0018. In one embodiment of the present invention, the computer's memory address Space is Set aside to Store the dynamic compiler comprises three stages: (1) a decoding target instruction Stream generated during the initial trans Stage for decoding the Source instructions and parameters, lation process. If this block is large enough to contain the creating an instruction Stream that is optimized based on the entire translated target instructions, emulation will proceed Source instructions and parameters; (2) an optimization stage at the maximum rate Since, in response to a given Source for optimizing the flow of information and related operation instruction, the RAM memory can be quickly addressed to code based on the characteristics of the target processor, and access the corresponding Sequence of target instructions for (3) an encoding stage for encoding instructions specifically execution. Otherwise, there is translation overhead neces for the target processor to achieve the intended results, Sary to handle translation of the program in Small Sections including

US 2002/0066086 A1 Linden (43) Pub

A Dynamically Recompiling ARM Emulator POWERED

A Brief History of Just-In-Time Compilation

Program Dynamic Analysis Overview

Transparent Dynamic Optimization: the Design and Implementation of Dynamo

Infrastructures and Compilation Strategies for the Performance of Computing Systems Erven Rohou

Costing JIT Traces

Arxiv:2106.08833V1 [Cs.NI] 16 Jun 2021 Mented on Commodity Servers, Are Widely Adopted in Real Tion Deployments

A Dynamic Optimization Framework for a Java Just-In-Time Compiler

Virtual Machines Uses for Virtual Machines

Improving Whole Program Code Locality in Managed Runtimes

Adaptive Optimization in the Jalape ˜No

Continuous Program Optimization Via Advanced Dynamic Compilation Techniques