, C++, & for the AMD and EM64T Processors • • • K K ered abugandchallengetoourdevelopmentteam. suiteisderivedfromtheSGI thePathScale FORTRANNoted foritsmature and excellent Optimizer, The QLogicPathScale ernized thiscompilerwithOpenMP2.5 andagnuC/C++front-end, improve performance. techniquesto framework purposebuiltforinsertingnewoptimization The PathScale compilerincludes aworld-class Optimization” Known “Every the Intel ofboththe formance 64-bitfeatures Advanced MicroDevices AMD64 and This advancedcompilersuitetakesadvanta ance 64-bitC, enhanced thecodegenera 64-bit Linuxcompilerwithcomplete GNU toolchaincompa is theworld’s highestperformance Theresultacrossalllanguages, sors. In real-worlda 40% performance gain overalterna D M E E E compiled objects Mix andmatchGNUPathScale just work. Uses standardflagssomostmakefiles modification Faster portswithlesssource code A Y Y R X

I I F F V M High Performance Compilers E E E U D A A ® M

T T F Xeon U U

R P R R O E the E E M R PathScale CompilerSuite S S ™ F C++ andF pplica

FIBRE CHANNELSWITCHES P

O EM64T architectures. R R O M V A tion testing, E N ™ At P N C

F Compiler Suiterepresentsthehighest-perform- E ortran compilersforLinux-basedenvironments. O

a tor tosupport AMD64 andIntelEM64Tproces- A U thScale, poor compiler optimization isconsid- poorcompiler optimization thScale, N N D D

A C T O HPC end-users have experiencedupto HPC end-usershave I O M tive compilerproducts. N P S A ® T Ca®oplr PathScale mod- /®compiler. I B I L I T ge oftheuniquehigh-per- Y • • • • pointers Cray/SGI Fortran95extensionsandcharacter and EnhancedLibraries FORTRAN 77/90/97withPopularExtensions, C++/FORTRAN supportandgdbsyntax Pathdb compatibledebuggerwithenhanced Uses standardGNUtoolchain PathScale This makesthe tibility • • • • • • • Enhanced debuggerforC++andFOR Compatible withGNU/gcctoolchain OpenMP 2.5C/C++/FORTRAN Code generationforAMD64andEM64T 32-bit and64-bitcompilation leadingoptimizations Industry C, C++,andFortranCompilers . and a pplication developmentfocused pplication bugs, and provides excellent assistance in application porting andtuning. andprovidesexcellent assistanceinapplication bugs, performanceproblems as QLogictreats engineering talentintheindustry. QLogic providesadvancedtechnicalsupport bysomeofthebestcompiler formance improvements, per- bugfixes, which includes allmajorandminorproductenhancements, The QLogicP bly codecanbeissued. inline AMD64 assem- Inaddition, simply performbetteronthisprocessor. 64-bitcodewill support, cient parameterpassingandSSE2/3DNow!™ moreeffi- largeregistersets, likecomplexaddressingmodes, advantages ca The QLogicPathScale CompilerSuite maximizes theperformanceofappli- c QLogic PathScale CompilerSuitetheclear choicefor64-bitcomputing available tion issoldonaperdeveloperbasis. on maximumperformance. W F O The clear choicefor64-bit lusters focusedonmaximumperformance. L P O incd hncmie o M peo ytm.With AMD Opteron tion codewhencompiledfor AMD Opteronsystems. E T R X I M L I B D I L Z

E C E

D L P A

TM U F S R O S a C R

thScale CompilerSuiteissoldasasubscriptionser S H

A U A M P S Compiler Suite P D I N O

G O R

P T O T P E and accesstocustomersupport. T • • • • • R TRAN I O O Tested withtheIntelMathKernelLibrary AMD CoreMathLibrary Free downloadablePathscaleoptimized OpenMP 2.5capableFortran Big-endian/Little-endian FortranI/O debuggers Works withpopularcommercial N N

S A N D

I N Processors for theAMDandIntelEM64T C T P , C++,&FortranCompilers E erpetual licenseoptionsarealso L

E M 6 4 T

S Y S T E M S The subscrip vice, - C, C++, & Fortran Compilers for the AMD and Intel EM64T Processors users maynotcopyormodifyQLogicproprietar end- ExceptaspermittedintheQLogicPathScale Subscription Agreement, thirdparty. codetoany sublicenseordistributeQLogicproprietary End-usersmaynottransfer, works. areindependentandseparate that whichincludes codesuchasrun-timeperformanceenhancinglibraries code, totheQLogicproprietary Opensourcelicensetermsdonotapply incompliancewithsection(3a)oftheGPL. distribution, with theirbinary a The QLogicP For alistofauthorizedresellers,visit •••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• • • Pathdbcommandlinedebugger • • • GNUC/C++front-endcompatiblewithOpenMP2.5 Functional Components • Peepholeoptimization • Loopunrolling • Globalregisterallocation • Instructionscheduling • If-conversion • Code Generation • IntructionschedulingfortheIntelEM64Tcore • Unalignedloads • • AllSSE3instructionsavailable • • Utilizes full64-bitISAincludingSSE2/3DNow!™ • SupportsAMD64ABI(x86-64) • Longaddresssupport • pplicable opensourcelicense. Tuned librariesincludingACML3.5 Singleprecisioncomplex(exceptdiv/mult) InstructionschedulingfortheAMDout-of-ordercore Fortran95withCray/SGIextensionsandOpenMP2.5 Controlflowoptimization Supportforinlineassemblycode a code base Optimizer andFortranderivedfromSGIOpen64 nd registersets 06Qoi oprto.Alrgt eevd Lgc h LgcLg,Ptsae h ahcl oo KPt,IfnPt,OtPt,and areAccelerating registered Cluster Performance trademarks or OptiPath, Corporation. Othertrademarks are thepropertyoftheir respective owners. InfiniPath, trademarks ofQLogic, EKOPath, thePathScalelogo, Pathscale, the QLogicLogo, © 2006QLogicCorporation. QLogic, All rightsreserved. PATHSCALE COMPILERSUITEFEATURES AMD64 SPECIFICCAPABILITIES EM64T SPECIFICCAPABILITIES a thScale CompilerSuitecontainsonacommondistributionmediumbothQLogicproprietar MAJOR OPTIMIZA Modifica TIONS tions toGPLedsourcearelicensedundertheGPL. 949.389.6000 Aliso 26650 QLogic Corpor Corpor y code. Viejo ate Headquarters Aliso www.qlogic.com/pathscale CA92656 , Viejo Parkway ation • Stepbackwardcapabilit • • • Understands Fortran77and90types,expression • Well formattedprintingofFortranvariables,arrays, Debugger (pathdb)compatibility • • Customlibrariesincluded,includinglibmandACML2.5 • • All commonFortran77extensionsforcompiling • OpenMP2.5support • Source compatibleFortran77/90/95withCray/ Fortran compatibility • MixandmatchGNUPathScalecompiledobjects • 100%binar • Source compatibleatGNU3.3.XwithOpenMP2.5 • • LoopFission • CacheBlocking • LoopInterchange • Loop-causeddependencyanalysis • Vectorization (includingSIMD) • Padarraystoreducecacheconflicts • Gather-scatter • ScalarExpansionandArray • Prefetching • Outerloopunrolling C/C++ Compatibility Big-endian/Little-endianFortranI/O Linkswithg77compiledlibraries LoopFusion compiled code Gdb compatiblecommandsandworkswithgnu lists anddynamictypes Properly printsstandardC++templates,maps,vectors, operators expressions “dusty deck”code extensions includingfullpointersupport BINARY ANDSOURCECODECOMPATIBILITY 64 bit y interoperability, includingnamemangling LOOP NESTOPTIMIZER End-users canreceivesourcecodeforalloftheopencomponentsQLogicP PathScale CompilerSuite TECHNICAL SPECIFICATIONS +440(0)1483 295825 Surrey GU27YGUK 40 OccamRoadGuildford Surrey TechnologyCentre QLogic (UK)LTD. Europe Headquarters y andopen-sourcesoftwarecomponents. WWW All opensourcesoftwareislicensedinfullcompliancewiththe • Automaticcommonblockpadding • Deadfunctionanddeadvariableelimination • Inter-procedural constantpropagationofglobals • • Operatesacrossmultiplecompilationruns • Partial redundancyeliminationbasedonSSAPRE Main optimizer • • Inductionvariablecanonicalization • Deadstoreanddeadcodeelimination • Tail recursionelimination • • Gotoconversionandloopnormalization Pre-optimizer • Registerpromotion • Red HatFedoraCore2,3 ™EnterpriseLinuxWorkstation 3and4 SuSE LinuxEnterpriseSer SuSE™ LinuxProfessional9.2and9.3 Aliasanalysis(flow-freeandflow-sensitive) Inductionvariableelimination Copypropagation Functioninlining and parameters framework –Linear functiontestreplacement –Strength reduction –Loop invariantcodemotion – Global commonsub-expression .QLOGIC.COM G TESTED OPERATING SYSTEMS INTER-PROCEDURAL ANALYSIS LOBAL SCALAROPTIMIZATION Processors for theAMDandIntelEM64T C, C++,&FortranCompilers ver 8and9 a hcl oplrSie along thScale CompilerSuite, SN0058046-00 Rev A 5/06