SQL Server 2005 on the IBM Eserver Xseries 460 Enterprise Server

Copy Link

IBM Front cover SQL Server 2005 on the IBM Eserver xSeries 460 Enterprise Server Describes how “scale-up” solutions suit enterprise databases Explains the XpandOnDemand features of the xSeries 460 Introduces the features of the new SQL Server 2005 Michael Lawson Yuichiro Tanimoto David Watts ibm.com/redbooks Redpaper International Technical Support Organization SQL Server 2005 on the IBM Eserver xSeries 460 Enterprise Server December 2005 Note: Before using this information and the product it supports, read the information in “Notices” on page vii. First Edition (December 2005) This edition applies to Microsoft SQL Server 2005 running on the IBM Eserver xSeries 460. © Copyright International Business Machines Corporation 2005. All rights reserved. Note to U.S. Government Users Restricted Rights -- Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents Notices . vii Trademarks . viii Preface . ix The team that wrote this Redpaper . ix Become a published author . .x Comments welcome. xi Chapter 1. The x460 enterprise server . 1 1.1 Scale up versus scale out . 2 1.2 X3 Architecture . 2 1.2.1 The X3 Architecture servers . 2 1.2.2 Key features . 3 1.2.3 IBM XA-64e third-generation chipset . 4 1.2.4 XceL4v cache . 6 1.3 XpandOnDemand . 6 1.4 Memory subsystem . 7 1.5 Multi-node configurations . 9 1.5.1 Positioning the x460 and the MXE-460. 11 1.5.2 Node connectivity . 12 1.5.3 Partitioning . 13 1.6 64-bit with EM64T . 14 1.6.1 What makes a 64-bit processor . 14 1.6.2 Intel Xeon with EM64T . 16 1.7 Dual core processors . 17 1.7.1 Why dual core processors. 17 1.7.2 Performance . 18 1.7.3 Software licensing. 19 1.7.4 Comparing with single core. 19 Chapter 2. SQL Server 2005 . 21 2.1 SQL Server 2005 editions . 22 2.2 New and enhanced features of SQL Server 2005 . 22 2.2.1 Database Engine enhancements . 22 2.2.2 Analysis Services enhancements . 23 2.2.3 Additional enhancements and features. 24 2.3 Windows, SQL Server, and 32-bit versus 64-bit . 25 2.3.1 Windows and SQL Server, both 32-bit . 25 2.3.2 Windows 64-bit and SQL Server 32-bit. 26 2.3.3 Windows and SQL Server 2005, both 64-bit. 27 2.4 Windows Server 2003 editions . 27 2.4.1 Comparing Windows Server 2003 editions. 27 2.4.2 Windows Datacenter models . 28 2.4.3 Processor and memory limits . 28 2.5 SQL Server 2005 high availability . 29 2.5.1 Database mirroring . 29 2.5.2 Data partitioning . 30 2.5.3 Support for hot-add memory . 32 2.6 Customer proof of concept . 32 © Copyright IBM Corp. 2005. All rights reserved. iii 2.6.1 Hardware configuration. 32 2.6.2 Software and storage configuration . 33 2.6.3 Results . 34 Chapter 3. Scalability and affinity . 35 3.1 Scalable hardware implementation . 36 3.2 Static Resource Affinity Table . 39 3.3 Affinity in Windows Server 2003 . 39 3.3.1 NUMA optimization for Windows Server 2003 . 39 3.3.2 Process and thread scheduling. 40 3.4 Affinity in SQL Server 2005. 41 3.4.1 SQL Server Operating System . 41 3.4.2 Processor and I/O affinity . 42 3.4.3 Network affinity . 43 3.4.4 Soft NUMA . 46 3.4.5 Memory . 48 3.5 Multiple instances . 49 3.5.1 Resource contention . 49 3.5.2 Clustering issues. 50 3.5.3 Performance . 51 3.5.4 Single or multiple . 51 3.6 Server consolidation . 53 3.6.1 General notion of consolidation . 53 3.6.2 Database server consolidation . ..

Recommended publications

Support for NUMA Hardware in Helenos

Charles University in Prague Faculty of Mathematics and Physics MASTER THESIS Vojtˇech Horký Support for NUMA hardware in HelenOS Department of Distributed and Dependable Systems Supervisor: Mgr. Martin Dˇecký Study Program: Software Systems 2011 I would like to thank my supervisor, Martin Dˇecký,for the countless advices and the time spent on guiding me on the thesis. I am grateful that he introduced HelenOS to me and encouraged me to add basic NUMA support to it. I would also like to thank other developers of HelenOS, namely Jakub Jermáˇrand Jiˇr´ı Svoboda, for their assistance on the mailing list and for creating HelenOS. I would like to thank my family and my closest friends too { for their patience and continuous support. I declare that I carried out this master thesis independently, and only with the cited sources, literature and other professional sources. I understand that my work relates to the rights and obligations under the Act No. 121/2000 Coll., the Copyright Act, as amended, in particular the fact that the Charles University in Prague has the right to conclude a license agreement on the use of this work as a school work pursuant to Section 60 paragraph 1 of the Copyright Act. Prague, August 2011 Vojtˇech Horký 2 Názevpráce:Podpora NUMA hardwaru pro HelenOS Autor: Vojtˇech Horký Katedra (ústav): Matematicko-fyzikáln´ıfakulta univerzity Karlovy Vedouc´ıpráce:Mgr. Martin Dˇecký e-mail vedouc´ıho:[email protected]ff.cuni.cz Abstrakt: C´ılemtétodiplomovépráceje rozˇs´ıˇritoperaˇcn´ısystémHelenOS o podporu ccNUMA hardwaru.
A Novel Hard-Soft Processor Affinity Scheduling for Multicore Architecture Using Multiagents

European Journal of Scientific Research ISSN 1450-216X Vol.55 No.3 (2011), pp.419-429 © EuroJournals Publishing, Inc. 2011 http://www.eurojournals.com/ejsr.htm A Novel Hard-Soft Processor Affinity Scheduling for Multicore Architecture using Multiagents G. Muneeswari Research Scholar, Computer Science and Engineering Department RMK Engineering College, Anna University-Chennai, Tamilnadu, India E-mail: [email protected] K. L. Shunmuganathan Professor & Head Computer Science and Engineering Department RMK Engineering College, Anna University-Chennai, Tamilnadu, India E-mail: [email protected] Abstract Multicore architecture otherwise called as SoC consists of large number of processors packed together on a single chip uses hyper threading technology. This increase in processor core brings new advancements in simultaneous and parallel computing. Apart from enormous performance enhancement, this multicore platform adds lot of challenges to the critical task execution on the processor cores, which is a crucial part from the operating system scheduling point of view. In this paper we combine the AMAS theory of multiagent system with the affinity based processor scheduling and this will be best suited for critical task execution on multicore platforms. This hard-soft processor affinity scheduling algorithm promises in minimizing the average waiting time of the non critical tasks in the centralized queue and avoids the context switching of critical tasks. That is we assign hard affinity for critical tasks and soft affinity for non critical tasks. The algorithm is applicable for the system that consists of both critical and non critical tasks in the ready queue. We actually modified and simulated the linux 2.6.11 kernel process scheduler to incorporate this scheduling concept.
Agent Based Load Balancing Scheme Using Affinity Processor Scheduling for Multicore Architectures

WSEAS TRANSACTIONS on COMPUTERS G. Muneeswari, K. L. Shunmuganathan Agent Based Load Balancing Scheme using Affinity Processor Scheduling for Multicore Architectures G.MUNEESWARI Research Scholar Department of computer Science and Engineering RMK Engineering College, Anna University (Chennai) INDIA [email protected] http://www.rmkec.ac.in Dr.K.L.SHUNMUGANATHAN Professor & Head Department of computer Science and Engineering RMK Engineering College, Anna University (Chennai) INDIA [email protected] http://www.rmkec.ac.in Abstract:-Multicore architecture otherwise called as CMP has many processors packed together on a single chip utilizes hyper threading technology. The main reason for adding large amount of processor core brings massive advancements in parallel computing paradigm. The enormous performance enhancement in multicore platform injects lot of challenges to the task allocation and load balancing on the processor cores. Altogether it is a crucial part from the operating system scheduling point of view. To envisage this large computing capacity, efficient resource allocation schemes are needed. A multicore scheduler is a resource management component of a multicore operating system focuses on distributing the load of some highly loaded processor to the lightly loaded ones such that the overall performance of the system is maximized. We already proposed a hard-soft processor affinity scheduling algorithm that promises in minimizing the average waiting time of the non critical tasks in the centralized queue and avoids the context switching of critical tasks. In this paper we are incorporating the agent based load balancing scheme for the multicore processor using the hard-soft processor affinity scheduling algorithm. Since we use the actual round robin scheduling for non critical tasks and due to soft affinity the load balancing is done automatically for non critical tasks.
Optimizing Linux* for Dual-Core AMD Opteron™ Processors Optimizing Linux for Dual-Core AMD Opteron Processors

Technical White Paper www.novell.com DATA CENTER Optimizing Linux* for Dual-Core AMD Opteron™ Processors Optimizing Linux for Dual-Core AMD Opteron Processors Table of Contents: 2 . SUSE Linux Enterprise and the 7 . How Can Applications Be Tuned AMD Opteron Platform for Dual-core? 2 . Dual-processor, Dual-core and 7 . What Tuning Is Possible to Control Multi-core Definitions Performance on a System Using Dual-core CPUs? 2 . Dual-core Background 7 . How Does the Linux 2.6 CPU 2 . AMD Specific Scheduler Handle Dual-core CPUs? . NUMA Background 3 8 . What User Mechanisms, Commands or System Calls (e.g., , 3 . NUMA Memory Access and taskset , etc.) Are Available to Allocation in Linux Basics numactl Allow Application Control on a 3 . What Are Characteristics of System with Dual-core CPUs? Workloads that Would Be Helped . How Does Dual-core Interact with by Using Dual-core CPUs? 9 AMD PowerNow! Technology? 4 . What Class of Parallel Workloads . How Does Dual-core Interact with Can Specifically Take Advantage 9 I/O and Async I/O? of Dual-core Systems? 9 . Should Hyper-threading Be Disabled 4 . How Can I Tell If My Workload or on Dual-core? Application Would Benefit? 10 . Conclusion 4 . What Tools Exist to Tell Me Whether My Application Would Benefit? 11 . Glossary p. 1 SUSE® Linux Enterprise and the AMD Opteron™ Platform Dual Core technologies are the latest evolu- core processor appears much the same as tion in processor design. Novell® enterprise a traditional multiprocessor system with two products are ready to take advantage of the single core chips (symmetric multiprocess- state of the art AMD64 dual-core technology.
Analysis of Application Sensitivity to System Performance Variability in a Dynamic Task Based Runtime

Analysis of Application Sensitivity to System Performance Variability in a Dynamic Task Based Runtime Galen Shipmany Kevin Pedretti Ramanan Sankaran Patrick McCormick Stephen Olivier Oak Ridge National Laboratory Los Alamos National Laboratory Kurt B. Ferreira Oak Ridge, TN, USA Los Alamos, NM, USA Sandia National Laboratories Albuquerque, NM, USA Sean Treichler Michael Bauer Alex Aiken NVIDIA Stanford University Santa Clara, CA Stanford, CA, USA ABSTRACT tem thereby providing a level of isolation of system services Application scalability can be significantly impacted by node from the application. Management of resource contention level performance variability in HPC. While previous studies at the application level is handled by the application de- have demonstrated the impact of one source of variability, veloper. Many application developers opt for a static par- OS noise, in message passing runtimes, none have explored titioning of resources mirroring domain decomposition due the impact on dynamically scheduled runtimes. In this pa- to its simplicity. However, this approach leaves many HPC per we examine the impact that OS noise has on the Le- applications vulnerable to the effects of OS noise, especially gion runtime. Our work shows that 2:5% net performance at scale. variability at the node level can result in 25% application The growing effects of OS noise has been one of the con- slowdown for MPI+OpenACC based runtimes compared to tributing factors in the development of dynamic runtime 2% slowdown for Legion. We then identify the mechanisms systems such as Charm++ [1], the Open Community Run- that contribute to better noise absorption in Legion, quan- time [2], Uintah [3], StarSs [4] and Legion [5].
Introduction to Openmp

COSC 6374 Parallel Computation Introduction to OpenMP Edgar Gabriel Fall 2012 Edgar Gabriel Introduction • Threads vs. processes • Recap of shared memory systems – SMP vs. NUMA – Cache Coherence – Process and thread affinity • Operating system scheduling – Load balancing across multiple processors Parallel Computation Edgar Gabriel 1 Processes vs. Threads • Process: – an address space with 1 or more threads executing within that address space, and the required system resources for those threads – a program that is running • Thread: – a sequence of instructions executed within a process – shares the resources in that process Parallel Computation Edgar Gabriel Processes vs. Threads (II) • Advantages of multi-threaded programming: • easier programming than multi-process models • lower overhead for creating a thread compared to creating a process • lower overhead for switching between threads required by the OS compared to switching between processes • Drawbacks of multi-threaded programming: • more difficult to debug than single threaded programs • for single processor machines, creating several threads in a program may not necessarily produce an increase in performance Parallel Computation Edgar Gabriel 2 Processes vs. Threads (III) Data per process Data per thread Address space Programcounter Open files Processor register Child processes Processor status Signal handler Signal mask Timer Stack Accounting Parallel Computation Edgar Gabriel Execution model • Main thread: – initial thread created when main() (in C) invoked by the process loader – once in main(), the application can create additional threads – exit() (or return() called in the main function) will terminate all threads • E.g. the process terminates even if there are still running threads • Threads can be terminated separately by using different functions Parallel Computation Edgar Gabriel 3 Threads vs.
White Paper Processor Affinity Multiple CPU Scheduling

TMurgent Technologies White Paper Processor Affinity Multiple CPU Scheduling November 3, 2003 White Paper: Processor Affinity 1/16 © TMurgent Technologies Introduction1 The purpose of this paper is to examine the role of “processor affinity”. In this paper I will explain: • What processor affinity is. • How the operating system uses processor affinity. • Situations when manipulation of processor affinity has been used. Finally, I will present TMurgent Technologies view on the appropriate and in- appropriate manipulation of processor affinity on Windows 2000 Server and Server 2003. Before we begin a note of caution: Playing with processor affinity on a production system should never be done. We often find that otherwise intelligent human beings learn “just enough to be dangerous” and try to use that knowledge. We strongly recommend that first, you read the entire paper – especially our view on in-appropriate manipulation of processor affinity. Second, after you have done that, if you are still excited about playing with processor affinity please do it on a test system that is not in production. 1 It is inevitable that a technical paper such as this will use terms that are Trademarks of other companies. Microsoft, Windows, SQL Server, IIS, and a variety of operating system names are trademarks of Microsoft Corporation. Citrix, MetaFrame, ICA, and SpeedScreen are trademarks of Citrix Systems. SysInternals is possibly a trademark of WinInternals. TMuLimit is a trademark of TMurgent Technologies. White Paper: Processor Affinity 2/16 © TMurgent Technologies I - What is “Processor Affinity” In a multi-tasking operating system, the work to be performed is broken into discrete components, called tasks.
Affinity Mask Recommended Value: 0 to Carry out Multitasking, Microsoft Windows 2000 and Windows Server 2003 Sometimes Move Process Threads Among Different Processors

Product Version Theme Environment Date Department Person Language SEE Electrical PLM V4Rx Database server Windows After Sales Technical Support EN Subject: SQL servers recommended configuration Description: This document describes the recommendations for the configuration of the Microsoft SQL server. Options list is valid for Microsoft SQL Server versions 2008 to 2014. Our proposed solution: 1 - Server level options To obtain your general server configuration options, execute: “SELECT * FROM sys.configurations ORDER BY name” The recommended values are provided in the form of the system view above. The option's description is taken from Books Online. 1.1 - Access check cache bucket count Recommended value: 0 The default value of 0 indicates that SQL Server is managing these options. Microsoft recommends only changing these options when directed by Microsoft Customer Support Services. 1.2 - Access check cache quota Recommended value: 0 The default values of 0 indicates that SQL Server is managing these options. Microsoft recommends only changing these options when directed by Microsoft Customer Support Services. 1.3 - Affinity I/O mask Recommended value: 0 1/10 The affinity I/O mask option binds SQL Server disk I/O to a specified subset of CPUs. In high-end SQL Server online transactional processing (OLTP) environments, this extension can enhance the performance of SQL Server threads issuing I/Os. 1.4 - Affinity mask Recommended value: 0 To carry out multitasking, Microsoft Windows 2000 and Windows Server 2003 sometimes move process threads among different processors. Although efficient from an operating system point of view, this activity can reduce SQL Server performance under heavy system loads, as each processor cache is repeatedly reloaded with data.
Red Hat Enterprise Linux 7 Performance Tuning Guide

Red Hat Enterprise Linux 7 Performance Tuning Guide Monitoring and optimizing subsystem throughput in RHEL 7 Last Updated: 2021-08-31 Red Hat Enterprise Linux 7 Performance Tuning Guide Monitoring and optimizing subsystem throughput in RHEL 7 Milan Navrátil Red Hat Customer Content Services Laura Bailey Red Hat Customer Content Services Charlie Boyle Red Hat Customer Content Services Edited by Marek Suchánek Red Hat Customer Content Services [email protected] Legal Notice Copyright © 2018 Red Hat, Inc. This document is licensed by Red Hat under the Creative Commons Attribution-ShareAlike 3.0 Unported License. If you distribute this document, or a modified version of it, you must provide attribution to Red Hat, Inc. and provide a link to the original. If the document is modified, all Red Hat trademarks must be removed. Red Hat, as the licensor of this document, waives the right to enforce, and agrees not to assert, Section 4d of CC-BY-SA to the fullest extent permitted by applicable law. Red Hat, Red Hat Enterprise Linux, the Shadowman logo, the Red Hat logo, JBoss, OpenShift, Fedora, the Infinity logo, and RHCE are trademarks of Red Hat, Inc., registered in the United States and other countries. Linux ® is the registered trademark of Linus Torvalds in the United States and other countries. Java ® is a registered trademark of Oracle and/or its affiliates. XFS ® is a trademark of Silicon Graphics International Corp. or its subsidiaries in the United States and/or other countries. MySQL ® is a registered trademark of MySQL AB in the United States, the European Union and other countries.
Nighttune User's Guide

NightTune User’s Guide Version 3.4 (RedHawkTM Linux®) 0898515-041 June 2011 Copyright 2011 by Concurrent Computer Corporation. All rights reserved. This publication or any part thereof is intended for use with Concurrent products by Concurrent personnel, customers, and end–users. It may not be reproduced in any form without the written permission of the pub- lisher. The information contained in this document is believed to be correct at the time of publication. It is subject to change without notice. Concurrent Computer Corporation makes no warranties, expressed or implied, concerning the information contained in this document. To report an error or comment on a specific portion of the manual, photocopy the page in question and mark the correction or comment on the copy. Mail the copy (and any additional comments) to Concurrent Computer Corporation, 2881 Gateway Drive, Pompano Beach, FL 33069-4324. Mark the envelope “Attention: Publications Department.” This publication may not be reproduced for any other reason in any form without written permission of the publisher. Concurrent Computer Corporation and its logo are registered trademarks of Concurrent Computer Corporation. All other Concurrent product names are trademarks of Concurrent while all other product names are trademarks or registered trademarks of their respective owners. Linux® is used pursuant to a sublicense from the Linux Mark Institute. NightStar’s integrated help system is based on Assistant, a QtTM utility. Qt is a trademark of Nokia Corporation. CUDATM is a trademark of NVIDIA Corporation. Preface Scope of Manual This guide is designed to assist you in getting started with use of NightTuneTM, a process and system analysis and tuning tool.
Multi-Core Object-Oriented Real-Time Pipeline

MULTI-CORE OBJECT-ORIENTED REAL-TIME PIPELINE FRAMEWORK by TROY S WOLZ A THESIS Submitted in partial fulfillment of the requirements for the degree of Master of Science in Engineering in The Department of Electrical and Computer Engineering to The School of Graduate Studies of The University of Alabama in Huntsville HUNTSVILLE, ALABAMA 2014 TABLE OF CONTENTS Page List of Figures ix List of Tables xi Chapter 1 Introduction 1 2 Literature Review 6 2.1 Real Time Operating System (RTOS) Options . 6 2.1.1 Hard Real-Time vs. Soft Real-Time . 7 2.1.2 Hard RTOS . 8 2.2 Bound Multiprocessing (BMP) . 9 2.3 Hardware vs Software Pipelining . 11 2.4 FastFlow . 13 2.5 MORPF and the OS . 13 2.6 Bringing it all Together . 14 3 Software Framework 15 3.1 Overview . 15 3.1.1 Real-Time Considerations . 17 v 3.1.2 Thread Creation . 17 3.1.3 Software Hierarchy . 18 3.1.4 Algorithmic Requirements for MORPF . 19 3.2 Creating a Pipeline . 20 3.2.1 Fan-Out and Fan-In . 21 3.3 Task Synchronization . 23 3.3.1 Data Exchange Buffers . 24 3.3.2 One-to-M Exchange . 27 3.3.3 M-to-One Exchange . 27 3.4 Differences betweeen MORPF and FastFlow . 30 4 Operating System Progression 33 4.1 Overview . 33 4.2 Synthetic Testing Benchmark . 33 4.3 Demonstrating the Need for MORPF . 35 4.4 Proving the Need for an RTOS . 36 4.5 The Kernel Tick . 36 4.6 Linux 3.10 with Xenomai .
Thread Scheduling and Related Interfaces in Netbsd 5.0

Thread scheduling and related interfaces in NetBSD 5.0 Mindaugas Rasiukevicius The NetBSD Project rmind at netbsd org May 4, 2009 Introduction A lot of new features were implemented in the NetBSD 5.0 release, and many improvements were made in the areas of scheduling, threading and symmetric multiprocessing (SMP). Like other modern UNIX-like operating systems, NetBSD supports traditional processes created by fork(2) and native POSIX threads (pthreads). Prior to the 5.0 release, user threading on NetBSD was implemented using a mechanism called scheduler activations (SA). The SA implementation was complicated, scaled poorly on multiprocessor systems and had no support for real-time applications. In NetBSD 5.0 these deﬁciencies have been addressed by replacing SA with an entirely new, scalable 1:1 threading model. In a 1:1 model each user thread (pthread) has a kernel thread called a light-weight process (LWP). Inside the kernel, both processes and threads are implemented as LWPs, and are served the same by the scheduler, as in Solaris, and other systems. In this article, we will review the scheduling of threads, related application programming interfaces, and utilities in the NetBSD operating system. Since the focus of this article is to introduce readers to new interfaces, we will not go into the details of implementations inside the kernel. Real-time and scheduling classes A real-time system is a predictable system which aims to meet certain time constraints (deadlines). Failure to meet these time constraints usually indicates hardware failure. Systems can be classiﬁed as either hard or soft real-time systems.