A Bibliography of Publications on Visual Instruction Sets

Nelson H. F. Beebe University of Utah Department of Mathematics, 110 LCB 155 S 1400 E RM 233 Salt Lake City, UT 84112-0090 USA Tel: +1 801 581 5254 FAX: +1 801 581 4148 E-mail: [email protected], [email protected], [email protected], [email protected] (Internet) WWW URL: http://www.math.utah.edu/~beebe/ 12 December 2019 Version 2.16

Title word cross-reference Mot98, Mot99b, Mot99a, Mot99c, Phi98]. Alto [IEE98]. AMD [Adv00b, Hal97, IM99, OWJF98, OFW99, Pou98]. AMD-K62-E 2 [IM99, Lee96, OWJF98]. 2.0 [LHxx]. [Adv00b]. Analysis [IM99, SB01]. Apple [Fit01]. Application [Hun00, PW96]. 3 [Hal97]. 3-D [Hal97]. 3DNow [OWJF98, Architecture [Hsu01, LHxx, MPW97, OFW99, Adv00a, BN99, IM99, Fom00]. OFW99, PW96, Phi98, TO96]. architectures [Ano96a]. Aspects [Hsu01]. 64-bit [LHxx]. Assembly [Sun00]. Auditorium [IEE98]. August [IEE96, IEE97, IEE98]. 7100LC Automatically [BGGT01]. [Ano95, BBJ+95, BKQW95, LBLS95, Smi94]. 712/60 [Smi94]. 7300LC [BT97]. based [BM00, Hun00]. Better [Ano97a]. Beyond [Hal97]. bit [LHxx]. Black [Ben95]. accelerate [TONH96]. Accelerates Blazing [Tho98]. Blinn [Bli97]. [DDHS00]. Accelerating breakthroughs [Smi94]. Briefs [GL97]. [DDC+98, Lee96, Sun96a]. achieve [Smi94]. Bugs [Ben95]. builds [Pou98]. Built Across [DDC+98]. algorithms [TONH96]. [Pea95]. Built-In [Pea95]. Business Alpha [RRM96]. Alternatives [Ano98]. [BEM95, GL97]. BYTE [Ano96a, Ano96b]. AltiVec [DDC+98, DDHS00, Fit01, Ful98,

1 2

California [IEE96, IEE97, IEE98]. free [Ano96a]. Fugue [Bli97]. Functions Capabilities [Pea95]. Case [Sun96a]. [BKQW95, SB01]. Centaur [Hal97]. challenge [Hal97]. Chip G4 [Fit01, Hun00]. Gaining [GL97]. gating [Sun96b, BT97, Pou98, TO96]. Chips [BM00]. Generation [Mah96, Phi98]. [Ano97b, IEE97, Hal97, IEE96, IEE98]. Graphics [Fom00, Pea95]. clock [BM00]. Code [Fom99]. Coding [Hsu01]. Combining [TO96]. commerce Hardware [Hsu01, Smi94]. High [Ano96b]. communications [PW96]. [AJ97, BEM95, Mod97, Pea95, Smi94, TO96]. Competition [GL97]. Competitive High-Performance [BEM95, Pea95, TO96]. [BKQW95]. Computational [Hun00]. highly [BT97]. hopped [Ano97a]. Hot Computer [Hsu01]. Computing [IEE96, IEE97, IEE98]. HP [Smi94]. [Hun00, JC98]. conference [IEE98]. Considerations [Wei96]. Contrast [Ben95]. IC [BKQW95, Ben95]. II [Mod97, KGOL97]. conventional [TONH96]. Converging Imaging [Ben95]. Implementation [GL97]. Core [Sun96a, Ano96b]. Corner [AJ97, Ano95, IM99, Kag96]. [Bli97]. Cost [BEM95, Pea95, Smi94]. CPU Implementations [OFW99]. Implicit [Mod97, Smi94]. CPUs [Tho98]. Cutting [BGGT01]. Improving [PW96, BM00]. [BN99]. Cyrix [Hal97]. INBOX [Ano96b]. Information [GL97]. innovative [Smi94]. Instruction D [Hal97]. Data [Adv00b]. Decisions [Mah96, RRM96, Sun96a, Sun96b, Sun02, [BKQW95]. Decoder [AJ97, LBLS95]. Lee96, TONH96]. Instructions Design [BBJ+95, BKQW95]. Desktop [GV06, IM99, AB01, TO96]. integrated [Ano97c]. Development [BEM95]. Digital [BT97]. [AJ97, GL97]. Dilemma [GL97]. Does [AB01, Ano96b, Ano97a, GL97, Hal97, KB96, [Tho98]. Dolby [AJ97]. Dynamic PW96, PWW97, Ram99, SB01, Wei96]. [Mod97, BM00]. Dynamics [Hun00]. Interface [Mot99a]. Internet [Ano96b, TH99]. issue [Ano97d, TO96]. Edge [BN99]. Effectively [BN99]. efficient issues [Ano96a]. IX [IEE97]. [Lee96]. electric [Ano96b]. Embedded [Adv00b]. Emmerald [AB01]. Enhanced Jerry [Pou98]. Jim [Bli97]. Just [Ano97b]. [Pou98, LBLS95]. Enhancements [Ano95]. Enhancing [TONH96]. Environment K6 [Pou98, IM99, OWJF98]. K6-2 [BKQW95]. Environments [Mot98]. [IM99, OWJF98]. K62 [Adv00b]. Key Evaluation [Hun00]. Examples [Mot99b]. [GV06]. Execution [Mod97]. Exploiting [BGGT01]. Extension [DDHS00, PW96]. Extensions Language [Sun00]. Levels [Ben95]. Linux [LHxx, Mah96, RRM96, JC98, Lee96, TH99]. [BN99]. Logic [Ben95]. Look [Fit01, Ano97c]. Low Fast [Ano97b, AB01]. First [Fit01, Kag96]. [BEM95, Pea95, Smi94]. Low-Cost Fits [Ano97a]. Floating [SGJ+99]. [BEM95, Pea95]. Floating-Point [SGJ+99]. Fluid [Hun00]. FORTRAN [Hun00]. FORTRAN-based Manual [Adv00a, Mot98, Mot99a, Sun00]. [Hun00]. four [TO96]. four-issue [TO96]. matrix [AB01]. matrix-matrix [AB01]. 3

MAX [Lee96]. MAX-2 [Lee96]. Media [Hun00]. PowerPC [DDHS00, Tho98]. [DDC+98, DDHS00, Sun96b, TONH96, praise [Ano96b]. Presents [GL97]. Price Lee96]. media-processing [TONH96]. [Smi94]. privacy [Ano96b]. Processing Memorial [IEE98]. methodologies [DDC+98, DDHS00, Sun96b, TONH96, [BBJ+95]. Microarchitecture JC98, Lee96]. Processor [Adv00b, Fit01, [Kag96, KGOL97]. Mah96, Mod97, BM00, TO96]. Processors [BKQW95, OWJF98, Phi98, BBJ+95, BT97]. [KGOL97, LBLS95]. Products [Ano94]. [KGOL97]. minimal Programming [KB96, Mot98, Mot99b, [Lee96]. MMX [AJ97, Ano96a, Ano96b, Mot99a, Ano96a, Ano96b]. Ano97a, Ano97b, Ano97c, Ano97d, Ano98, Bli97, Dro00, Fom99, GL97, Hal97, Kag96, Quality [AJ97]. KGOL97, KB96, MPW97, Mod97, PW96, PWW97, Pou98, Ram99, SB01, Wei96]. readers [Ano96a, Ano96b]. Real Mobile [JC98]. Model [Smi94]. money [Fom00, LBLS95]. Real-Time [Ano96b]. Motion [RRM96]. Motorola [Fom00, LBLS95]. Reciprocal [IM99]. [Ful98]. MPEG [LBLS95]. Multimedia record [IEE96, IEE98]. Reference [Sun00]. [Ano95, LHxx, Mah96, Pea95, PWW97, RISC [LHxx, Pea95, TONH96]. Root SGJ+99, LBLS95, PW96, TO96]. [IM99]. Routines [GV06]. multimedia-enhanced [LBLS95]. multiply [AB01]. Multiuser [BEM95]. Scientific [Hun00]. Second [Phi98]. Series [Smi94]. Server [BEM95, Ano96a]. Set Networking [Pea95, Sun96a]. New-Media [Mah96, Sun96a, Sun02, Lee96, Sun96b]. sets [Sun96b]. News [GL97]. Notes [GL97]. NT [TONH96]. Sheet [Adv00b]. significantly [Ano96a]. [TONH96]. SIMD [TH99, Phi98]. Simulation [Hun00]. Sixth [Mah96]. off [Ano96b, Wei96]. on-chip [TO96]. Sixth-Generation [Mah96]. Smart [GL97]. operation [BM00]. Optimization [Fom99]. Software [AS96, Hsu01, LBLS95]. sound Optimizing [Fom00]. Order [SGJ+99]. [Ano96b]. souped [Ano97c]. souped-up Out-of-Order [SGJ+99]. Overview [Ano97c]. SPARC [Sun00]. Special [Ano95, MPW97, Ram99]. [Ano97d]. Spectrum [DDC+98]. Speed [GV06]. Speeds [TONH96]. Square [IM99]. P55C [Kag96]. PA SSE [AB01, SB01]. Stanford [Ano95, BBJ+95, BKQW95, BT97, LBLS95, [IEE96, IEE97, IEE98]. Starts [Ano97a]. LHxx, Pea95, Smi94]. PA-7100LC [Smi94]. strategies [BM00, KB96]. Streaming PA-RISC [LHxx, Pea95]. packaging [TH99]. Studio [GV06]. Study [Smi94]. packing [BM00]. Palo [IEE98]. [BKQW95, SB01]. Subword [JC98, Lee96]. Parallelism [BGGT01, Lee96]. PC [AS96]. Sun [GV06]. Superscalar [SGJ+99, Smi94]. PCs [Ano96a, Ano97c, PWW97]. Pentium Support [Sun96b]. supporting [Ano97a, Ano97c, KGOL97, Smi94]. [Lee96, TO96]. symposium [IEE96]. PentiumAE [Mod97]. Performance System [BEM95, BT97, Pou98]. Systems [AS96, BEM95, Mod97, Pea95, SB01, Wei96, [Hun00, Ano97c, JC98]. BM00, PW96, Smi94, TO96]. Phoning [GL97]. Point [SGJ+99]. Power Technology [Adv00a, AJ97, Ano97d, [Ano97c, Smi94, BM00, Hal97]. PowerMac DDC+98, Fom99, Ful98, GL97, Kag96, REFERENCES 4

KGOL97, MPW97, Mod97, Mot98, Mot99a, ISSN 1532-0626 (print), 1532- Mot99c, OWJF98, OFW99, PW96, Phi98, 0634 (electronic). URL http:/ Ram99, Wei96, Ano97c]. test [Ano97a]. /www3.interscience.wiley.com/ Those [Ano97b]. Three [SGJ+99]. cgi-bin/abstract/77004416/START; Three-Way [SGJ+99]. Time http://www3.interscience.wiley. [Fom00, LBLS95]. times [PW96]. Tour com/cgi-bin/fulltext?ID=77004416& [Mot99c]. Trade [Wei96]. Trade-off [Wei96]. PLACEBO=IE.pdf. Transitioning [GL97]. Translating [Dro00]. Tuning [AS96]. TV [GL97]. AMD:2000:TM

UltraSPARC [Sun96a, Sun96b, TO96]. [Adv00a] , Inc., Unit [SGJ+99]. University One AMD Place, P.O. Box [IEE96, IEE97, IEE98]. Unix [Smi94]. 3453, Sunnyvale, California, USA. Upgrade [GL97]. uses [Pou98, Smi94]. 3DNow! Technology Manual, March Using [GV06, Sun96a, AJ97, AB01]. 2000. x + 62 pp. URL Utilizing [BN99]. http://www.amd.com/products/ epd/processors/6.32bitproc/8. Value [BM00]. Value-based [BM00]. amdk6fami/26.amdk62e/21928/21928. Vector [Tho98]. Version [Sun02]. Video pdf. Order number 21928G/0. [RRM96, JC98, LBLS95]. VIII [IEE96]. AMD:2000:AKE VIS [GV06, Sun96a, Sun96b, Sun02, TONH96]. [Adv00b] Advanced Micro Devices, Inc., Visual [Sun96b]. Voices [GL97]. Voltage One AMD Place, P.O. Box [Ben95]. Vulnerable [GL97]. 3453, Sunnyvale, California, USA. AMD-K62-E+ Embedded Pro- + Want [GL97]. Way [SGJ 99]. Web [GL97]. cessor Data Sheet, September White [Ben95]. will [Hal97]. Windows 2000. xxii + 346 pp. URL [Ano96a]. Workbench [Pou98]. http://www.amd.com/products/ Workstation [Pea95, Smi94]. write epd/processors/6.32bitproc/8. [Ano96a]. amdk6fami/28.amdk62e/23542/23542a. pdf. Order number 23542A/0. [Mah96]. Abel:1997:IHQ

References [AJ97] James C. Abel and Michael A. Aberdeen:2001:EFM Julier. Implementation of a high quality Dolby digital decoder us- [AB01] Douglas Aberdeen and Jonathan ing MMX technology. Intel Tech- Baxter. Emmerald: a fast matrix- nology Journal, (Q3):11, 1997. matrix multiply using Intel’s SSE URL http://developer.intel. instructions. Concurrency and com/technology/itj/q31997/articles/ Computation: Practice and Ex- art_3.htm; http://developer. perience, 13(2):103–119, Febru- intel.com/technology/itj/q31997/ ary 2001. CODEN CCPEBO. pdf/decoder.pdf. REFERENCES 5

Anonymous:1994:NPd up with Intel’s MMX. Byte Mag- azine, 22(2):26–??, February 1997. [Ano94] Anonymous. New Products. Data- CODEN BYTEDJ. ISSN 0360-5280 mation, 40(4):85–??, February 15, (print), 1082-7838 (electronic). 1994. CODEN DTMNAT. ISSN 0011-6963. Anonymous:1997:JHF Anonymous:1995:OIP [Ano97b] Anonymous. Just how fast are those new MMX chips? Byte Magazine, [Ano95] Anonymous. Overview of the imple- 22(2):26–??, February 1997. CO- mentation of the PA 7100LC multi- DEN BYTEDJ. ISSN 0360-5280 media enhancements. Hewlett-Pack- (print), 1082-7838 (electronic). ard Journal: technical information from the laboratories of Hewlett- Anonymous:1997:MPD Packard Company, 46(2):66, April 1995. CODEN HPJOAX. ISSN [Ano97c] Anonymous. MMX power for desk- 0018-1153. URL http://www.hp. top PCs: We look at 10 Pentium com/hpj/95apr/apr95_66.pdf; systems souped-up with MMX tech- http://www.hp.com/hpj/toc-04- nology. Byte Magazine, 22(7):106– 95.html. ??, July 1997. CODEN BYTEDJ. ISSN 0360-5280 (print), 1082-7838 Anonymous:1996:BRW (electronic).

[Ano96a] Anonymous. BYTE readers write Anonymous:1997:SIM about new server architectures, Windows NT issues, MMX pro- [Ano97d] Anonymous. Special issue on MMX gramming, and “free PCs”. Byte technology. Intel Technology Jour- Magazine, 21(10):19–??, October nal, (Q3), 1997. URL http:// 1996. CODEN BYTEDJ. ISSN developer.intel.com/technology/ 0360-5280 (print), 1082-7838 (elec- itj/q31997.htm. tronic). Anonymous:1998:MA Anonymous:1996:IBRa [Ano98] Anonymous. MMX alternatives. [Ano96b] Anonymous. INBOX — BYTE Byte Magazine, 23(7):26–??, July readers sound off on electric money, 1998. CODEN BYTEDJ. ISSN Internet commerce and privacy, pro- 0360-5280 (print), 1082-7838 (elec- gramming for Intel’s MMX, and tronic). praise the core of BYTE. Byte Mag- azine, 21(9):19–??, September 1996. Atkins:1996:PSP CODEN BYTEDJ. ISSN 0360-5280 [AS96] Mark Atkins and Ramesh Subra- (print), 1082-7838 (electronic). maniam. PC software performance Anonymous:1997:BFS tuning. Computer, 29(8):47–54, Au- gust 1996. CODEN CPTRB4. ISSN [Ano97a] Anonymous. Better in fits and starts 0018-9162 (print), 1558-0814 (elec- — we test a new Pentium hopped tronic). REFERENCES 6

Bass:1995:DMP Bik:2001:AEI

[BBJ+95] Mick Bass, Terry W. Blanchard, [BGGT01] Aart J. C. Bik, Milind Girkar, D. Douglas Josephson, Duncan Paul M. Grey, and Xinmin Tian. Weir, and Daniel L. Halperin. De- Automatically exploiting implicit sign methodologies for the PA parallelism. Dr. Dobb’s Jour- 7100LC microprocessor. Hewlett- nal of Software Tools, 26(7):28, Packard Journal: technical in- 30, 32–34, July 2001. CO- formation from the laboratories DEN DDJOEB. ISSN 1044- of Hewlett-Packard Company,46 789X. URL http://www.ddj.com/ (2):23–35, April 1995. CO- ftp/2001/2001_07/parallel.txt. DEN HPJOAX. ISSN 0018-1153. Bass:1995:PMC URL http://www.hp.com/hpj/ 95apr/apr95_23.pdf; http:// [BKQW95] Mick Bass, Patrick Knebel, www.hp.com/hpj/toc-04-95.html. David W. Quint, and William L. Walker. The PA 7100LC micro- Bowers:1995:DLC processor: A case study of IC de- sign decisions in a competitive en- [BEM95] Dennis A. Bowers, Gerard M. vironment. Hewlett-Packard Jour- Enkerlin, and Karen L. Murillo. nal: technical information from Development of a low-cost, high- the laboratories of Hewlett-Pack- performance, multiuser business ard Company, 46(2):12–22, April server system. Hewlett-Packard 1995. CODEN HPJOAX. ISSN Journal: technical information from 0018-1153. URL http://www.hp. the laboratories of Hewlett-Pack- com/hpj/95apr/apr95_12.pdf; ard Company, 46(2):79–84, April http://www.hp.com/hpj/toc-04- 1995. CODEN HPJOAX. ISSN 95.html. 0018-1153. URL http://www.hp. com/hpj/95apr/apr95_79.pdf; Blinn:1997:JBC http://www.hp.com/hpj/toc-04- [Bli97] James F. Blinn. Jim Blinn’s corner: 95.html. Fugue for MMX. IEEE Computer Benzel:1995:BBW Graphics and Applications, 17(2): 88–93, March/April 1997. CODEN [Ben95] Jack D. Benzel. Bugs in black and ICGADZ. ISSN 0272-1716 (print), white: Imaging IC logic levels with 1558-1756 (electronic). Makes sev- voltage contrast. Hewlett-Packard eral cogent comments about defi- Journal: technical information from ciencies in the Intel MMX pixel- the laboratories of Hewlett-Pack- processing instruction set [PWW97] ard Company, 46(2):102–106, April for use in image compositing. 1995. CODEN HPJOAX. ISSN Brooks:2000:VBC 0018-1153. URL http://www.hp. com/hpj/95apr/apr95_1b.pdf; [BM00] David Brooks and Margaret Martonosi. http://www.hp.com/hpj/toc-04- Value-based clock gating and op- 95.html. eration packing: dynamic strate- REFERENCES 7

gies for improving processor power Hunter Scales. AltiVec extension and performance. ACM Transac- to PowerPC accelerates media pro- tions on Computer Systems, 18(2): cessing. IEEE Micro, 20(2):85– 89–126, May 2000. CODEN AC- 95, March/April 2000. CODEN SYEC. ISSN 0734-2071 (print), IEMIDZ. ISSN 0272-1732 (print), 1557-7333 (electronic). URL http: 1937-4143 (electronic). URL http: //www.acm.org/pubs/citations/ //dlib.computer.org/mi/books/ journals/tocs/2000-18-2/p89- mi2000/pdf/m2085.pdf. brooks/. Drongowski:2000:TM Bush:1999:CEE [Dro00] Paul J. Drongowski. Translat- [BN99] Jonathan Bush and Timothy S. ing MMX. Computer, 33(3):43, Newman. The cutting edge: Ef- March 2000. CODEN CPTRB4. fectively utilizing 3DNow! in ISSN 0018-9162 (print), 1558-0814 Linux. Linux Journal, 68:??, De- (electronic). URL http://dlib. cember 1999. CODEN LIJOFX. computer.org/co/books/co2000/ ISSN 1075-3583 (print), 1938- pdf/r3040.pdf. 3827 (electronic). URL http:// Fite:2001:FLA noframes.linuxjournal.com/lj- issues/issue68/3685.html. [Fit01] Matthew Fite. First look at an Blanchard:1997:PMH Apple G4 with the AltiVec proces- sor. Linux Journal, 86:108, 110–111, [BT97] T. W. Blanchard and P. G. To- 114, 116, 118, June 2001. CODEN bin. The PA 7300LC micropro- LIJOFX. ISSN 1075-3583 (print), cessor: a highly integrated sys- 1938-3827 (electronic). URL http: tem on a chip. Hewlett-Packard //noframes.linuxjournal.com/ Journal: technical information from lj-issues/issue86/4584.html. the laboratories of Hewlett-Packard Company, 48(3):43–47, June 1997. Fomitchev:1999:MTC CODEN HPJOAX. ISSN 0018- [Fom99] Max I. Fomitchev. MMX tech- 1153. URL http://www.hp.com/ nology code optimization. Dr. hpj/97jun/ju97a6.htm. Dobb’s Journal of Software Tools, Diefendorff:1998:ATA 24(9):38, 40, 42–46, 48, September 1999. CODEN DDJOEB. ISSN [DDC+98] K. Diefendorff, P. Dubey, R. Chochsprung, 1044-789X. URL http://www.ddj. et al. AltiVec technology: Accel- com/ftp/1999/1999_09/.txt; erating media processing across the http://www.ddj.com/ftp/1999/ spectrum. In IEEE [IEE98], page ?? 1999_09/mmx.zip. ISBN ???? LCCN ???? Fomitchev:2000:ORT Diefendorff:2000:AEP [Fom00] Max I. Fomitchev. Optimizing [DDHS00] Keith Diefendorff, Pradeep K. 3Dnow! real-time graphics. Dr. Dubey, Ron Hochsprung, and Dobb’s Journal of Software Tools, REFERENCES 8

25(8):40, 42–46, August 2000. CO- Halfhill:1997:BMN DEN DDJOEB. ISSN 1044- 789X. URL http://www.ddj.com/ [Hal97] Tom R. Halfhill. Beyond MMX — ftp/2000/2000_08/3dnow.txt; new 3-D power of AMD, Centaur, http://www.ddj.com/ftp/2000/ and Cyrix chips will challenge In- 2000_08/3dnow.zip. tel. Byte Magazine, 22(12):87–??, December 1997. CODEN BYTEDJ. Fuller:1998:MAT ISSN 0360-5280 (print), 1082-7838 [Ful98] Sam Fuller. Motorola’s Al- (electronic). tiVec technology. Technical Hsu:2001:CAS Report ALTIVECWP/D, Mo- torola Corporation, Phoenix, AZ, [Hsu01] John Y. Hsu. Computer Archi- USA, 1998. 4 pp. URL tecture: Software Aspects, Coding, http://a1888.g.akamai.net/7/ Hardware. CRC Press, 2000 N.W. 1888/787/83ade987b85512/www. Corporate Blvd., Boca Raton, FL motorola.com/SPS/PowerPC/teksupport/ 33431-9868, USA, 2001. ISBN 0- teklibrary/papers/altivec_wp. 8493-1026-1, 1-351-83604-8, 1-4200- pdf. 4110-X (e-book). 427 pp. LCCN Grosky:1997:NSV A76.9.A73 H758 2001. US$89.95, UK£59.99. [GL97] William Grosky and Anne C. Lear. In the news: Smart but vulnerable; Hunter:2000:EPG transitioning to digital TV; gaining voices; business briefs; technology [Hun00] Craig A. Hunter. An evalu- notes; phoning the Web; converging ation of PowerMac G4 systems competition; Intel presents MMX for FORTRAN-based scientific com- — and another upgrade dilemma; puting with application to com- want more information? IEEE putational fluid dynamics simula- MultiMedia, 4(1):12–16, January– tion. Technical report, NASA March 1997. CODEN IEMUE4. Langley Research Center, Con- ISSN 1070-986X (print), 1941-0166 figuration Aerodynamics Branch, (electronic). URL http://dlib. Hampton, VA, USA, July 2000. URL computer.org/mu/books/mu1997/ http://ad-www.larc.nasa. pdf/u1012.pdf. gov/~cah/NASA_G4_Study.pdf. Gove:2006:SSU IEEE:1996:HCV

[GV06] Darryl Gove and Geetha Vallabha- [IEE96] IEEE, editor. Hot chips VIII: sym- nen. Sun Studio: Using VIS in- posium record: Stanford Univer- structions to speed up key routines. sity, Stanford, California, August Sun Developer Network, January 5, 18–20, 1996. IEEE Computer Soci- 2006. URL http://developers. ety Press, 1109 Spring Street, Suite sun.com/prodtech/cc/articles/ 300, Silver Spring, MD 20910, USA, vis.html. 1996. ISBN ???? LCCN ???? REFERENCES 9

IEEE:1997:HCI tion of MMX technology. In IEEE [IEE96], pages 157–162. ISBN ???? [IEE97] IEEE, editor. Hot Chips IX: Stan- LCCN ???? ford University, Stanford, Califor- nia, August 24–26, 1997. IEEE Khazam:1996:PSI Computer Society Press, 1109 [KB96] Jonathan Khazam and Bev Bach- Spring Street, Suite 300, Silver mayer. Programming strategies for Spring, MD 20910, USA, 1997. Intel’s MMX. Byte Magazine, 21(8): ISBN ???? LCCN ???? 63–64, August 1996. CODEN BYT- IEEE:1998:HCC EDJ. ISSN 0360-5280 (print), 1082- 7838 (electronic). [IEE98] IEEE, editor. Hot chips 10: conference record: August 16–18, Kagan:1997:MMP 1998, Memorial Auditorium, Stan- ford University, Palo Alto, Califor- [KGOL97] Michael Kagan, Simcha Gochman, nia. IEEE Computer Society Press, Doron Orenstien, and Derrick Lin. 1109 Spring Street, Suite 300, Sil- MMX microarchitecture of Pentium ver Spring, MD 20910, USA, 1998. processors with MMX technology ISBN ???? LCCN ???? and Pentium II microprocessors. Intel Technology Journal, (Q3):8, Iordache:1999:ARS 1997. URL http://developer. [IM99] Cristina Iordache and David W. intel.com/technology/itj/q31997/ Matula. Analysis of reciprocal and articles/art_4.htm; http:// square root reciprocal instructions developer.intel.com/technology/ in the AMD K6-2 implementation of itj/q31997/pdf/micro.pdf. 3DNow! Electronic Notes in Theo- Lee:1995:RTS retical Computer Science, 24:34–62, 1999. CODEN ???? ISSN 1571- [LBLS95] Ruby B. Lee, John P. Beck, Joel 0661. Lamb, and Kenneth E. Severson. Jennings:1998:MCS Real-time software MPEG video de- coder on multimedia-enhanced PA [JC98] Matthew D. Jennings and Thomas M. 7100LC processors. Hewlett-Pack- Conte. Mobile computing: Subword ard Journal: technical informa- extensions for video processing on tion from the laboratories of Hew- mobile systems. IEEE Concurrency, lett-Packard Company, 46(2):60–68, 6(3):13–16, July/September 1998. April 1995. CODEN HPJOAX. CODEN IECMFX. ISSN 1092- ISSN 0018-1153. URL http: 3063 (print), 1558-0849 (electronic). //www.hp.com/hpj/95apr/apr95_ URL http://dlib.computer.org/ 60.pdf; http://www.hp.com/ pd/books/pd1998/pdf/p3013.pdf. hpj/toc-04-95.html. Kagan:1996:PMF Lee:1996:SPM [Kag96] Michael Kagan. The P55C microar- [Lee96] Ruby B. Lee. Subword parallelism chitecture — the first implementa- with MAX-2: Accelerating media REFERENCES 10

processing with a minimal set of in- gramming Interface Manual, June struction extensions supporting ef- 1999. 262 pp. Order number AL- ficient subword parallelism. IEEE TIVECPIM/D 6/1999 Rev. 0. Micro, 16(4):51–59, July/August 1996. CODEN IEMIDZ. ISSN 0272- Motorola:1999:APE 1732 (print), 1937-4143 (electronic). [Mot99b] Motorola, Inc. AltiVec program- Lee:19xx:BME ming examples. World-Wide Web document., December 9, 1999. [LHxx] Ruby Lee and Jerry Huck. 64-bit URL http://www.motorola.com/ and multimedia extensions in the SPS/PowerPC/AltiVec/CodeMain. PA-RISC 2.0 architecture. Techni- html. cal report, Hewlett–Packard Com- Motorola:1999:ATT pany, 19410 Homestead Road, Cupertino, CA 95014, USA, [Mot99c] Motorola, Inc. AltiVec technology 19xx. URL http://www.hp.com/ tour. World-Wide Web document., computing/framed/technology/ December 9, 1999. URL http:// micropro/architecture/docs/pa2go3. www.motorola.com/SPS/PowerPC/ html. AltiVec/technology.html. Maher:1996:MIS Mittal:1997:MTA [Mah96] Robert Maher. Multimedia in- struction set extensions for a sixth- [MPW97] Millind Mittal, Alex Peleg, and generation x86 processor. In IEEE Uri Weiser. MMX technology ar- [IEE96], pages 163–170. ISBN ???? chitecture overview. Intel Tech- LCCN ???? nology Journal, (Q3):12, 1997. URL http://developer.intel. Modi:1997:PIC com/technology/itj/q31997/articles/ [Mod97] Nimish Modi. The PentiumAE art_2.htm; http://developer. II CPU: A high performance dy- intel.com/technology/itj/q31997/ namic execution processor with pdf/archite.pdf. MMX technology. In IEEE [IEE97], Oberman:1999:ATA page ?? ISBN ???? LCCN ???? [OFW99] Stuart Oberman, Greg Favor, and Motorola:1998:ATP Fred Weber. AMD 3DNow! tech- [Mot98] Motorola Corporation, Phoenix, nology: Architecture and imple- AZ, USA. AltiVec Technology Pro- mentations. IEEE Micro, 19(2): gramming Environments Manual, 37–48, March/April 1999. CO- November 1998. 350 pp. Order DEN IEMIDZ. ISSN 0272- number ALTIVECPEM/D 11/1998 1732 (print), 1937-4143 (elec- Rev. 0.1. tronic). URL http://dlib. Motorola:1999:ATP computer.org/mi/books/mi1999/ pdf/m2037.pdf; http://www. [Mot99a] Motorola Corporation, Phoenix, computer.org/micro/mi1999/m2037abs. AZ, USA. AltiVec Technology Pro- htm. REFERENCES 11

Oberman:1998:ATK architecture — improving multime- dia and communications applica- [OWJF98] Stuart Oberman, Fred Weber, tion performance by 1.5 to 2 times. Norbert Juffa, and Greg Favor. IEEE Micro, 16(4):42–50, July/ AMD 3DNow! technology and August 1996. CODEN IEMIDZ. the K6-2 microprocessor. In IEEE ISSN 0272-1732 (print), 1937-4143 [IEE98], pages 245–254. ISBN ???? (electronic). LCCN ???? Peleg:1997:IMM Pearson:1995:LCH [PWW97] Alex Peleg, Sam Wilkie, and Uri [Pea95] Roger A. Pearson. A low-cost, Weiser. Intel MMX for multime- high-performance PA-RISC work- dia PCs. Communications of the station with built-in graphics, mul- ACM, 40(1):24–38, January 1997. timedia, and networking capa- CODEN CACMA2. ISSN 0001- bilities. Hewlett-Packard Jour- 0782 (print), 1557-7317 (electronic). nal: technical information from URL http://www.acm.org/pubs/ the laboratories of Hewlett-Pack- citations/journals/cacm/1997- ard Company, 46(2):6–11, April 40-1/p25-peleg/. See also Blinn’s 1995. CODEN HPJOAX. ISSN comments [Bli97] about MMX in- 0018-1153. URL http://www.hp. struction set deficiencies. com/hpj/95apr/apr95_6t.pdf; Ramirez:1999:OIM http://www.hp.com/hpj/toc-04- 95.html. [Ram99] Ariel Ortiz Ramirez. An overview Phillip:1998:ATS of Intel’s MMX technology. Linux Journal, 61:??, May 1999. CODEN [Phi98] Mike Phillip. AltiVec technology: LIJOFX. ISSN 1075-3583 (print), A second generation SIMD micro- 1938-3827 (electronic). processor architecture. In IEEE [IEE98], page ?? ISBN ???? LCCN Rubinfeld:1996:MVI ???? [RRM96] Paul Rubinfeld, Bob Rose, and Pournelle:1998:WJB Michael McCallig. Motion video instruction extensions for alpha. [Pou98] Jerry Pournelle. From the work- Technical report, Semiconduc- bench — Jerry builds a new sys- tor Engineering Group, 77 Reed tem that uses AMD’s K6 MMX en- Road, HLO2-3/D11 Hudson, MA hanced chip. Byte Magazine,23 01749, USA, October 18, 1996. (1):123–??, January 1998. CODEN URL http://www.digital.com/ BYTEDJ. ISSN 0360-5280 (print), semiconductor/alpha/papers/pmvi. 1082-7838 (electronic). pdf. Peleg:1996:MTE Strey:2001:PAI

[PW96] Alex Peleg and Uri Weiser. MMX [SB01] Alfred Strey and Martin Bange. technology extension to the Intel Performance analysis of Intel’s REFERENCES 12

MMX and SSE: A case study. struction set. Technical Re- Lecture Notes in Computer Sci- port WPR-0013, Sun Microsys- ence, 2150:142–??, 2001. CO- tems, 901 San Antonio Road, Palo DEN LNCSD9. ISSN 0302- Alto, CA 94303-4900, USA, August 9743 (print), 1611-3349 (electronic). 1996. URL http://www.sun.com/ URL http://link.springer-ny. microelectronics/whitepapers/ com/link/service/series/0558/ wpr-0013/index.html. Demon- bibs/2150/21500142.htm; http: strates speedups of 2x to 5x on key //link.springer-ny.com/link/ networking kernels from use of the service/series/0558/papers/2150/ VIS instruction set. 21500142.pdf. Sun:1996:UVI Scherer:1999:OTW [Sun96b] . UltraSPARC: The Visual Instruction Set (VIS): [SGJ+99] Alisa Scherer, Michael Golden, Nor- On chip support for new-media pro- bert Juffa, Stephan Meier, Stu- cessing. Technical Report WPR- art Oberman, Hamid Partovi, and 0004, Sun Microsystems, 901 San Fred Weber. An out-of-order Antonio Road, Palo Alto, CA three-way superscalar multimedia 94303-4900, USA, 1996. URL http: floating-point unit. In 1999 IEEE //www.sun.com/microelectronics/ International Solid-State Circuits whitepapers/wpr-0004; http: Conference, page ?? IEEE Com- //www.sun.com/microelectronics/ puter Society Press, 1109 Spring whitepapers/wpr-0004/wpr-0004. Street, Suite 300, Silver Spring, MD pdf. 20910, USA, 1999. ISBN 0-7803- 5129-0. LCCN ???? Sun:2000:SAL Smith:1994:PWA [Sun00] Sun Microsystems, 901 San Antonio Road, Palo Alto, CA 94303-4900, [Smi94] Ben Smith. Power workstation at a USA. SPARC Assembly Language Pentium price: The HP 9000 Series Reference Manual, February 2000. 700 Model 712/60 uses innovative URL ftp://192.18.99.138/806- hardware packaging and the super- 3774/806-3774.pdf. Part Number scalar PA-7100LC CPU to achieve 806-3774. See Appendix E.6 for Ul- breakthroughs in low cost and high traSPARC and VIS Instruction Set performance for a Unix workstation. Extensions. Byte Magazine, 19(7):161–??, July 1994. CODEN BYTEDJ. ISSN Sun:2002:VIS 0360-5280 (print), 1082-7838 (elec- [Sun02] Sun Microsystems. The VIS instruc- tronic). tion set, version 1.0. White paper, Sun:1996:ACN Sun Microsystems, Network Circle Santa Clara, CA 95054, USA, June [Sun96a] Sun Microsystems. Accelerat- 2002. URL http://www.sun.com/ ing core networking functions us- processors/vis/download/vis/ ing the UltraSPARC VIS in- vis_whitepaper.pdf; http:// REFERENCES 13

www.sun.com/processors/whitepapers/ cro, 16(4):10–20, July/August 1996. vis_wp_external.pdf. CODEN IEMIDZ. ISSN 0272-1732 (print), 1937-4143 (electronic). Thakkar:1999:ISS Weiser:1996:TCP [TH99] Shreekant (Ticky) Thakkar and Tom Huff. The Internet Streaming [Wei96] Uri Weiser. Trade-off considerations SIMD Extensions. Intel Technol- and performance of Intel’s MMX ogy Journal, (Q2):8, May 17, 1999. technology. In IEEE [IEE96], pages URL http://developer.intel. 147–156. ISBN ???? LCCN ???? com/technology/itj/q21999/articles/ art_1.htm; http://developer. intel.com/technology/itj/q21999/ pdf/simd_ext.pdf. Thompson:1998:CPD [Tho98] Tom Thompson. CPUs: PowerPC does blazing vector. Byte Magazine, 23(7):43–??, July 1998. CODEN BYTEDJ. ISSN 0360-5280 (print), 1082-7838 (electronic). Tremblay:1996:UFI [TO96] Marc Tremblay and J. Michael O’Connor. UltraSparc I: A four- issue processor supporting multi- media: Combining on-chip mul- timedia instructions with a high- performance, four-issue architec- ture. IEEE Micro, 16(2):42– 50, March/April 1996. CODEN IEMIDZ. ISSN 0272-1732 (print), 1937-4143 (electronic). Presented at Hot Chips VII, Stanford University, Stanford, California, August 1995. Tremblay:1996:VSN [TONH96] Marc Tremblay, J. Michael O’Connor, Venkatesh Narayanan, and Liang He. VIS speeds new me- dia processing — enhancing con- ventional RISC instruction sets to significantly accelerate media- processing algorithms. IEEE Mi-