Overclocking Matters For 3D Architectural Cad Design

Total Page:16

File Type:pdf, Size:1020Kb

Overclocking Matters For 3D Architectural Cad Design WHY OVERCLOCKING MATTERS FOR 3D ARCHITECTURAL CAD DESIGN www.boxx.com 877-877-2699 Avoid The Spinning Circle Virtually all CAD software is single threaded or lightly threaded. This means that the vast majority of design workflows will not make use of all available CPU cores. Design tasks Most of you are familiar with the spinning blue circle execute faster with that Windows will mock you with as your workstation seemingly takes a break from its assigned task. It’s the higher processor mascot of poor productivity and an unwelcome sight when you’re just trying to get through your workday. clock speeds. What causes it and what’s the remedy? The Test Machines It’s About CPU Frequency CAD software like Autodesk Revit is what is known as “frequency bound”. The speed at which a design task is executed is restricted by the clock BOXX APEXX S3 CPU: i7 8700k @ 4.8GHz speed of your workstation’s processor. The vast majority of tasks like model RAM: 512 GB creation, selecting objects, exporting views and general interaction with GPU: Quadro P4000 your model in the viewport can be sped up by using a higher frequency HD: 512GB M.2 Windows 10 CPU. OS: This makes the choice of CPU critical when selecting a workstation for CAD design. Our competitors will present high core count, low frequency CPUs as the highest performing workstations for running professional DELL 7920 CPU: Xeon Gold @ 3.6GHz applications.¹ This can be misleading when addressing the compute RAM: 512 GB bottlenecks of architectural CAD tools. As a rule of thumb, the more cores GPU: Quadro P4000 a CPU has, the lower the clock speed (frequency). A CPU with fewer cores HD: 512GB M.2 OS: Windows 10 will generally have a higher clock speed. For optimal CAD design performance, customers should seek out processors with fewer CPU cores. Overclocked Advantage HP Z6 Boosting the frequency of your CPU (overclocking) is one of the best ways CPU: Xeon Gold @ 3.6GHz RAM: 512 GB to extract maximum performance out of your hardware investment and GPU: Quadro P4000 maximum productivity out of your designers. BOXX has been profession- HD: 512GB M.2 ally overclocking CAD workstations for over a decade and we’re the only OS: Windows 10 workstation manufacturer offering safe, reliable overclocking backed by a standard three-year warranty. To demonstrate the difference an overclocked system has over competing Xeon based solutions, we ran the popular Revit RFO benchmark on identical configurations from DELL and HP against our flagship CAD workstation, the APEXX S3. www.boxx.com Across the board, you can clearly see the advantage of BOXX’s overclocked, high frequency solution for 3D CAD design tools like Autodesk Revit. APEXX S3 is optimized for All values are represented in seconds and are a measure of the time to complete each task. Lower scores are better. Model Creation Export Note: The time to complete the benchmarked tasks will increase proportionately with larger Revit models. www.boxx.com Graphics - Standard S3 • Features the latest overclocked Intel® Core™ i7 processor with advanced liquid cooling • Up to two dual slot professional GPUs • Up to 64GB of memory • Compact chassis with quiet Graphics - Standard operation CALL A BOXX AEC PERFORMANCE SPECIALIST TODAY AT 877-877-2699 Intel® Core™ i7 processor ® 1. Based on messaging found here http://www.dell.com/en-us/work/shop/workstations-isv-cer- Intel Inside . Extraordinary tified-dell/sc/workstations/precision-desktops Performance Outside. 2. Claim based on lightly threaded workflows like 3D modeling. If user is rendering (a ® Intel, the Intel Logo, Intel Inside, Intel Core, and Core Inside are multi-threaded task) on the same workstation, higher core count CPUs should be considered. trademarks of Intel Corporation in the U.S. and/or other countries..
Recommended publications
  • Analysis and Optimization of Dynamic Voltage and Frequency Scaling for AVX Workloads Using a Software-Based Reimplementation
    Analysis and Optimization of Dynamic Voltage and Frequency Scaling for AVX Workloads Using a Software-Based Reimplementation Bachelor’s Thesis submitted by cand. inform. Yussuf Khalil to the KIT Department of Informatics Reviewer: Prof. Dr. Frank Bellosa Second Reviewer: Prof. Dr. Wolfgang Karl Advisor: Mathias Gottschlag, M.Sc. May 03 – September 02, 2019 KIT – The Research University in the Helmholtz Association www.kit.edu I hereby declare that the work presented in this thesis is entirely my own and that I did not use any source or auxiliary means other than these referenced. This thesis was carried out in accordance with the Rules for Safeguarding Good Scientic Practice at Karlsruhe Institute of Technology (KIT). Karlsruhe, September 2, 2019 Abstract While using the Advanced Vector Extensions (AVX) on current Intel x86 pro- cessors allows for great performance improvements in programs that can be parallelized by using vectorization, many heterogeneous workloads that use both vector and scalar instructions expose degraded throughput when mak- ing use of AVX2 or AVX-512. This eect is caused by processor frequency reductions that are required to maintain system stability while executing AVX code. Due to the delays incurred by frequency switches, reduced clock speeds are attained for some additional time after the last demanding instruction has retired, causing code in scalar phases directly following AVX phases to be executed at a slower rate than theoretically possible. We present an analysis of the precise frequency switching behavior of an Intel Syklake (Server) CPU when AVX instructions are used. Based on the obtained results, we propose avxfreq, a software reimplementation of the AVX frequency selection mechanism.
    [Show full text]
  • Drmos King of Power-Saving
    Insist on DrMOS King of Power-saving Confidential Eric van Beurden / Feb 2009 /Page v1.0 1 EU MSI King of Power-saving What are the power-saving components and technologies from MSI? 1. DrMOS 2. APS 3. GreenPower design 4. Hi-c CAP Why should I care about power-saving? 1. Better earth (Think about it! You can be a hero saving it !) 2. Save $$ on the electricity bill 3. Cool running boards 4. Better overclocking Confidential Page 2 MSI King of Power-saving Is DrMOS the name of a MSI heatpipe? No! DrMOS is the cool secret below the heatpipe, not the heatpipe itself. Part of the heatpipe covers the PWM where the DrMOS chips are located. (PWM? That is technical stuff, right ? Now you really lost me ) Tell me, should I write DRMOS, Dr. MOS or Doctor Mos? The name comes from Driver MOSFET. There is only one correct way to write it; “DrMOS”. Confidential Page 3 MSI King of Power-saving So DrMOS is a chip below the heatpipe? Yes, DrMOS is the 2nd generation 3-in-1 integrated Driver MOSFET. It combines 3 PWM components in one. (Like triple core…) 1. Driver IC 2. Bottom-MOSFET 3. Top-MOSFET Confidential Page 4 MSI King of Power-saving Is MSI the first to use DrMOS on it’s products? DrMOS is an integrated MOSFET design proposed by Intel in 2004. The first to use a 1st generation Driver Mosfet on a 8-Phase was Asus Blitz Extreme. This 1st generation had some problems and disadvantages. These are all solved in the 2nd generation DrMOS which we use exclusive on MSI products.
    [Show full text]
  • Website Designing, Overclocking
    Computer System Management - Website Designing, Overclocking Amarjeet Singh November 8, 2011 Partially adopted from slides from student projects, SM 2010 and student blogs from SM 2011 Logistics Hypo explanation Those of you who selected a course module after last Thursday and before Sunday – Apologies for I could not assign the course module Bonus deadline for these students is extended till Monday 5 pm (If it applies to you, I will mention it in the email) Final Lab Exam 2 weeks from now - Week of Nov 19 Topics will be given by early next week No class on Monday (Nov 19) – Will send out relevant slides and videos to watch Concerns with Videos/Slides created by the students Mini project Demos: Finish by today Revision System Cloning What is it and Why is it useful? What are different ways of cloning the system? Data Recovery Why is it generally possible to recover data that is soft formatted or mistakenly deleted? Why is it advised not to install any new software if data is to be recovered? Topics For Today Optimizing Systems Performance (including Overclocking) Video by Vaibhav, Shubhankar and Mukul - http://www.youtube.com/watch?v=FEaORH5YP0Y&feature=youtu.be Creating a basic website A method of pushing the basic hardware components beyond the default limits Companies equip their products with such bottle-necks because operating the hardware at higher clock rates can damage or reduce its life span OCing has always been surrounded by many baseless myths. We are going to bust some of those. J Slides from Vinayak, Jatin and Ashrut (2011) The primary benefit is enhanced computer performance without the increased cost A common myth is that CPU OC helps in improving game play.
    [Show full text]
  • A+ Guide to Managing and Maintaining Your PC, 7E
    A+ Guide to Managing and Maintaining Your PC, 7e Chapter 6 Supporting Processors Objectives • Learn about the characteristics and purposes of Intel and AMD processors used for personal computers • Learn about the methods and devices for keeping a system cool • Learn how to install and upgrade a processor • Learn how to solve problems with the processor, the motherboard, overheating, and booting the PC A+ Guide to Managing and Maintaining Your PC, 7e 2 Types and Characteristics of Processors • Processor – Installed on motherboard – Determines system computing power • Two major processor manufacturers – Intel and AMD Figure 6-1 An AMD Athlon 64 X2 installed in socket AM2+ with cooler not yet installed Courtesy: Course Technology/Cengage Learning A+ Guide to Managing and Maintaining Your PC, 7e 3 Types and Characteristics of Processors (cont’d.) • Features affecting processor performance and motherboards – System bus speeds the processor supports – Processor core frequency – Motherboard socket and chipset – Multiprocessing ability – Memory cache – Amount and type of DDR, DDR2, DDR3 memory – Computing technologies the processor can use – Voltage and power consumption A+ Guide to Managing and Maintaining Your PC, 7e 4 How a Processor Works • Three basic components – Input/output (I/O) unit • Manages data and instructions entering and leaving the processor – Control unit • Manages all activities inside the processor – One or more arithmetic logic units (ALUs) • Performs all logical comparisons, calculations A+ Guide to Managing and Maintaining
    [Show full text]
  • Power-Aware Load Balancing of Large Scale MPI Applications
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by UPCommons. Portal del coneixement obert de la UPC Power-Aware Load Balancing Of Large Scale MPI Applications Maja Etinskiy Julita Corbalany Jesus Labartay [email protected] [email protected] [email protected] Mateo Valeroy Alex Veidenbaumz [email protected] [email protected] yBarcelona Supercomputing Center zDepartment of Computer Science Jordi Girona 31, 08034 Barcelona, Spain University of California, Irvine CA Abstract Several solutions have been proposed to load balance ap- plications via processor resource distribution ([1]). Via Dy- Power consumption is a very important issue for HPC namic Voltage Frequency Scaling (DVFS) that enables per community, both at the level of one application or at the processor frequency control it is possible to balance imbal- level of whole workload. Load imbalance of a MPI ap- anced applications in power aware manner. In load imbal- plication can be exploited to save CPU energy without pe- anced MPI applications, there are processes which complete nalizing the execution time. An application is load imbal- their computation and have to wait for the other processes anced when some nodes are assigned more computation to communicate. These nodes can run at lower frequen- than others. The nodes with less computation can be run cies and save energy consumed by CPU without increasing at lower frequency since otherwise they have to wait for the execution time. The dynamic power is proportional to the nodes with more computation blocked in MPI calls. A the product of the frequency and the square of the voltage technique that can be used to reduce the speed is Dynamic and it can be reduced via DVFS technique.
    [Show full text]
  • Dynamic Management of Turbomode in Modern Multi-Core Chips David Lo and Christos Kozyrakis Stanford University {Davidlo, Kozyraki}@Stanford.Edu
    Dynamic Management of TurboMode in Modern Multi-core Chips David Lo and Christos Kozyrakis Stanford University {davidlo, kozyraki}@stanford.edu Abstract core chips from Intel and AMD. It overclocks a cores by utiliz- ing available thermal headroom from idle execution resources Dynamic overclocking of CPUs, or TurboMode, is a feature [12, 1]. Current chips can dynamically overclock a core by up recently introduced on all x86 multi-core chips. It leverages ther- to 40% or more, which can lead to an increase in performance mal and power headroom from idle execution resources to over- by up to 40%. As the number of cores per chip increases over clock active cores to increase performance. TurboMode can ac- time, the overclocking range and the performance implications celerate CPU-bound applications at the cost of additional power of TurboMode will also increase. TurboMode is controlled by consumption. Nevertheless, naive use of TurboMode can signif- firmware using an embedded hardware controller that sets the icantly increase power consumption without increasing perfor- exact clock frequency based on the thermal headroom available mance. Thus far, there is no strategy for managing TurboMode and the expected performance benefits. Software has little con- to optimize its use across all workloads and efficiency metrics. trol over TurboMode, except for the ability to enable/disable it. This paper analyzes the impact of TurboMode on a wide Unfortunately, TurboMode is not always beneficial and de- range of efficiency metrics (performance, power, cost, and com- ciding when to enable it is quite complex. As seen in Section 3, bined metrics such as QPS=W and ED2) for representative the optimal TurboMode setting varies across different applica- server workloads on various hardware configurations.
    [Show full text]
  • A Universal Self-Calibrating Dynamic Voltage and Frequency Scaling (DVFS) Scheme with Thermal Compensation for Energy Savings in Fpgas
    A Universal Self-Calibrating Dynamic Voltage and Frequency Scaling (DVFS) Scheme with Thermal Compensation for Energy Savings in FPGAs Shuze Zhao1, Ibrahim Ahmed1, Carl Lamoureux1, Ashraf Lotfi2, Vaughn Betz1 and Olivier Trescases1 1University of Toronto, Toronto, ON, Canada 10 King’s College Road, Toronto, ON, M5S 3G4, Canada 2Altera Corp., Hampton, NJ, 08827, USA Email: [email protected] Abstract— Field Programmable Gate Arrays (FPGAs) are Currently, FPGA designers operate each IC at its rated widely used in telecom, medical, military and cloud computing nominal voltage, and must choose a clock frequency at or applications. Unlike in microprocessors, the routing and critical below the limit predicted by the Computer-Aided Design path delay of FPGAs is user dependent. The design tool sug- gests a maximum operating frequency based on the worst-case (CAD) tool’s timing analysis. This timing analysis is extremely timing analysis of the critical paths at a fixed nominal voltage, conservative, using worst-case models for process corners, on- which usually means there is significant voltage or frequency chip voltage drop, temperature and aging. In the vast majority margin in a typical chip. This paper presents a universal of chips and systems, however, the supply voltage can be offline self-calibration scheme, which automatically finds the reduced significantly below nominal in order to obtain energy FPGA frequency and core voltage operating limit at different self-imposed temperatures by monitoring design-specific critical savings. Operating the IC at a lower voltage also reduces paths. These operating points are stored in a calibration table the impact of aging effects such as Bias-Threshold Instability and used to dynamically adjust the frequency and core voltage (BTI), and improves the chip lifetime [12], [13].
    [Show full text]
  • 2010 VISION Quick Reference Guide for Desktop
    2010 VISION Quick Reference Guide for Desktop VISION Technology from AMD lets you enjoy a more vivid and smooth visual experience by combining cutting-edge processing and video power. FEATURES AND BENEFITS VISION Technology > VISION Technology from AMD enables reliable performance on today’s most popular applications > Get vivid, lifelike photos and crystal clear streaming video with up to 1 billion colors Maximum control ™ Platform > Featuring ATI Radeon Graphics for a superior visual experience with your favorite Overclocking casual games Massive headroom ATI Eyefinity VISION Premium Technology technology > With next-generation processor technology AMD Direct Connect Architecture, Microsoft® DirectX® you can enjoy a highly responsive PC experience 11 support > Connect your PC to your HDTV and enjoy Blu-ray™ [HD] movies with Create/Edit HD 7.1 surround sound and full 1080p detail with single-cable HDMI output movies Create/Edit HD Create/Edit movies > Speed up the conversion of video files and get fast video editing with movies Watch Blu-ray® ATI Stream Technology1 Watch Blu-ray®/ HD movies > Get dramatic energy efficiency improvements, automatically, with virtually HD movies Media player video no impact on performance with AMD PowerNow!™ 3.0 technology2 Media player video conversion conversion Watch DVD movies VISION Ultimate Technology Watch DVD movies Online videos > Deliver the speed, responsiveness, and performance of ultra-high Online videos Advanced photo bandwidth with power-optimized HyperTransport™ 3.0 technology Advanced photo
    [Show full text]
  • Latency-Aware DVFS for Efficient Power State Transitions on Many
    J Supercomput manuscript No. (will be inserted by the editor) Latency-aware DVFS for Efficient Power State Transitions on Many-core Architectures Zhiquan Lai · King Tin Lam · Cho-Li Wang · Jinshu Su Received: date / Accepted: date Abstract Energy efficiency is quickly becoming a first-class constraint in HPC de- sign. We need more efficient power management solutions to save energy costs and carbon footprint of HPC systems. Dynamic voltage and frequency scaling (DVFS) is a commonly used power management technique for making a trade-off between power consumption and system performance according to the time-varying program behavior. However, prior work on DVFS seldom takes into account the voltage and frequency scaling latencies, which we found to be a crucial factor determining the ef- ficiency of the power management scheme. Frequent power state transitions without latency awareness can make a real impact on the execution performance of applica- tions. The design of multiple voltage domains in some many-core architectures has made the effect of DVFS latencies even more significant. These concerns lead us to propose a new latency-aware DVFS scheme to adjust the optimal power state more ac- curately. Our main idea is to analyze the latency characteristics in depth and design a novel profile-guided DVFS solution which exploits the varying execution and memory access patterns of the parallel program to avoid excessive power state transitions. We implement the solution into a power management library for use by shared-memory parallel applications. Experimental evaluation on the Intel SCC many-core platform shows significant improvement in power efficiency after using our scheme.
    [Show full text]
  • Cost-Efficient Overclocking in Immersion-Cooled Datacenters
    Cost-Efficient Overclocking in Immersion-Cooled Datacenters Majid Jalili∗§, Ioannis Manousakisy, I´nigo˜ Goiri∗, Pulkit A. Misra∗, Ashish Raniwalay, Husam Alissaz, Bharath Ramakrishnanz, Phillip Tumax, Christian Beladyz, Marcus Fontouray, and Ricardo Bianchini∗ ∗Microsoft Research yMicrosoft Azure zMicrosoft CO+I x3M Abstract—Cloud providers typically use air-based solutions for the near future [23], [66]. For example, manufacturers expect cooling servers in datacenters. However, increasing transistor to produce CPUs and GPUs capable of drawing more than counts and the end of Dennard scaling will result in chips 500W in just a few years [23], [31], [66]. with thermal design power that exceeds the capabilities of air cooling in the near future. Consequently, providers have started For these reasons, providers have started to explore liquid to explore liquid cooling solutions (e.g., cold plates, immersion cooling solutions (e.g., cold plates, liquid immersion) for their cooling) for the most power-hungry workloads. By keeping the most power-hungry workloads [3], [20], [68], [74]. These servers cooler, these new solutions enable providers to operate technologies keep chip temperatures at a lower and narrower server components beyond the normal frequency range (i.e., range than air cooling, reducing leakage power, eliminating overclocking them) all the time. Still, providers must tradeoff the increase in performance via overclocking with its higher power the need for fans, and reducing datacenter Power Usage draw and any component reliability implications. Effectiveness (PUE), i.e. the ratio of total power to IT (e.g., In this paper, we argue that two-phase immersion cooling server, networking) power. For example, Google cools its (2PIC) is the most promising technology, and build three pro- Tensor Processing Units (TPUs) with cold plates [25], [52].
    [Show full text]
  • Dynamic Voltage and Frequency Scaling with Multi-Clock Distribution Systems on SPARC Core" (2009)
    Rochester Institute of Technology RIT Scholar Works Theses Thesis/Dissertation Collections 5-1-2009 Dynamic voltage and frequency scaling with multi- clock distribution systems on SPARC core Michael Nasri Michael Follow this and additional works at: http://scholarworks.rit.edu/theses Recommended Citation Michael, Michael Nasri, "Dynamic voltage and frequency scaling with multi-clock distribution systems on SPARC core" (2009). Thesis. Rochester Institute of Technology. Accessed from This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact [email protected]. Dynamic Voltage and Frequency Scaling with Multi-Clock Distribution Systems on SPARC Core by Michael Nasri Michael A Thesis Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science in Computer Engineering Supervised by Dr. Dhireesha Kudithipudi Department of Computer Engineering Kate Gleason College of Engineering Rochester Institute of Technology Rochester, NY May 2009 Approved By: Dr. Dhireesha Kudithipudi Assistant Professor, RIT Department of Computer Engineering Primary Advisor Dr. Ken Hsu Professor, RIT Department of Computer Engineering Dr. Muhammad Shaaban Associate Professor, RIT Department of Computer Engineering Thesis Release Permission Form Rochester Institute of Technology Kate Gleason College of Engineering Title: Dynamic Voltage and Frequency Scaling with Multi-Clock Distribution Systems on SPARC Core I, Michael Nasri Michael, hereby grant permission to the Wallace Memorial Library to reproduce my thesis in whole or part. Michael Nasri Michael Date Dedication To my parents for their pride and encouragement To my brother and sisters for their love and support Acknowledgements Thanks to Dr.
    [Show full text]
  • Overclocking Assistant (Performance Tuning Guide) for Intel® NUC Kit Nuc8i7hnk and Nuc8i7hvk Objective Memory Overclock Perfor
    Overclocking Assistant (Performance Tuning Guide) for Intel® NUC Kit NUC8i7HNK and NUC8i7HVK June 2018 WARNING Altering PC memory frequency and/or voltage may (i) reduce system stability and useful life of the system, memory, and processor; (ii) cause the processor and other system components to fail; (iii) cause reductions in system performance; (iv) cause additional heat or other damage; and (v) affect system data integrity. Intel assumes no responsibility that the memory included, if used with altered clock frequencies and/or voltages, will be fit for any particular purpose. Check with the memory manufacturer for warranty and additional details. Objective The Purpose of this performance tuning guide is intended for validation and system characterization purposes only. Memory Overclock Performance Memory performance tuning is controlled via several platform mechanisms. Beyond this, the platform is also capable of supporting manual overrides to the memory frequency, timings, and voltage levels of both the integrated memory controller (iMC) and the DIMMs. If you’re going to try overclocking the module, you’ll have to decide which is more important – the speed or the timings. In most cases, you can’t have both. Follow the settings below if you want to overclock your memory. Note: some memory modules don’t allow any overclocking. DDR4 Frequency Memory Reference Multiplier Memory QCLK (133MHZ/100MHz) Multiplier 1600 1.33 12 1600 1 16 1700 1 17 1733 1.33 13 1800 1 18 1867 1.33 14 1900 1 19 2000 1.33 15 2000 1 20 2100 1 21 2133 1.33 16 2200 1 22 2267 1.33 17 2300 1 23 2400 1.33 18 2400 1 24 2500 1 25 2533 1.33 19 2667 1.33 20 2700 1 27 2 2800 1.33 21 2800 1 28 2900 1 29 2933 1.33 22 3000 1 30 3067 1.33 23 3100 1 31 3200 1.33 24 Manual Memory Overclocking Example Example: You have a DDR4 2133 module and want to overclock to 2267 MHz.
    [Show full text]