User-Space Process Virtualization in the Context of Checkpoint-Restart and Virtual Machines
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  Iocage Documentation Release 1.2iocage Documentation Release 1.2 Brandon Schneider and Peter Toth Sep 26, 2019 Contents 1 Documentation: 3 1.1 Install iocage...............................................3 1.2 Basic Usage...............................................6 1.3 Plugins.................................................. 10 1.4 Networking................................................ 11 1.5 Jail Types................................................. 14 1.6 Best Practices............................................... 15 1.7 Advanced Usage............................................. 16 1.8 Using Templates............................................. 20 1.9 Create a Debian Squeeze Jail (GNU/kFreeBSD)............................ 21 1.10 Known Issues............................................... 22 1.11 FAQ.................................................... 23 1.12 Indices and tables............................................ 24 Index 25 i ii iocage Documentation, Release 1.2 iocage is a jail/container manager written in Python, combining some of the best features and technologies the FreeBSD operating system has to offer. It is geared for ease of use with a simplistic and easy to learn command syntax. FEATURES: • Templates, basejails, and normal jails • Easy to use • Rapid thin provisioning within seconds • Automatic package installation • Virtual networking stacks (vnet) • Shared IP based jails (non vnet) • Dedicated ZFS datasets inside jails • Transparent ZFS snapshot management • Binary updates • Export and import • And many more! Contents 1 iocage Documentation,
- 
												  Freebsd's Jail(2) FacilityLousy virtualization, Happy users: FreeBSD's jail(2) facility Poul-Henning Kamp [email protected] CHROOT(2) FreeBSD System Calls Manual CHROOT(2) NAME chroot -- change root directory LIBRARY Standard C Library (libc, -lc) SYNOPSIS #include <unistd.h> int chroot(const char *dirname); Calling chroot(2) in ftpd(1) implemented ”anonymous FTP” without the hazzle of file/pathname parsing and editing. ”anonymous FTP” became used as a tool to enhance network security. By inference, chroot(2) became seen as a security enhancing feature. ...The source were not strong in those. Exercise 1: List at least four ways to escape chroot(2). Then the Internet happened, ...and web-servers, ...and web-hosting Virtual hosts in Apache User get their own ”virtual apache” but do do not get your own machine. Also shared: Databases mailprograms PHP/Perl etc. Upgrading tools (PHP, mySQL etc) on virtual hosting machines is a nightmare. A really bad nightmare: Cust#1 needs mySQL version > N Cust#2 cannot use mySQL version <M (unless PHP version > K) Cust#3 does not answer telephone Cust#4 has new sysadmin Cust#5 is just about ready with new version Wanted: Lightweight virtualization Same kernel, but virtual filesystem and network address plus root limitations. Just like chroot(2) with IP numbers on top. Will pay cash. Close holes in chroot(2) Introduce ”jail” syscall + kernel struct Block jailed root in most suser(9) calls. Check ”if jail, same jail ?” in strategic places. Fiddle socket syscall arguments: INADDR_ANY -> jail.ip INADDR_LOOPBACK -> jail.ip Not part of jail(2): Resource restriction Hardware virtualization Covert channel prevention (the hard stuff) Total implementation: 350 changed source lines 400 new lines of code FreeBSD without jail usr Resources of various sorts / home var process process process process process process Kernel FreeBSD with jail usr Resources of various sorts / home var process process process process* process process Kernel error = priv_check_cred( cred, PRIV_VFS_LINK, SUSER_ALLOWJAIL); if (error) return (error); The unjailed part One jailed part of the system.
- 
												  Sandboxing 2 Change Root: Chroot()Sandboxing 2 Change Root: chroot() Oldest Unix isolation mechanism Make a process believe that some subtree is the entire file system File outside of this subtree simply don’t exist Sounds good, but. Sandboxing 2 2 / 47 Chroot Sandboxing 2 3 / 47 Limitations of Chroot Only root can invoke it. (Why?) Setting up minimum necessary environment can be painful The program to execute generally needs to live within the subtree, where it’s exposed Still vulnerable to root compromise Doesn’t protect network identity Sandboxing 2 4 / 47 Root versus Chroot Suppose an ordinary user could use chroot() Create a link to the sudo command Create /etc and /etc/passwd with a known root password Create links to any files you want to read or write Besides, root can escape from chroot() Sandboxing 2 5 / 47 Escaping Chroot What is the current directory? If it’s not under the chroot() tree, try chdir("../../..") Better escape: create device files On Unix, all (non-network) devices have filenames Even physical memory has a filename Create a physical memory device, open it, and change the kernel data structures to remove the restriction Create a disk device, and mount a file system on it. Then chroot() to the real root (On Unix systems, disks other than the root file system are “mounted” as a subtree somewhere) Sandboxing 2 6 / 47 Trying Chroot # mkdir /usr/sandbox /usr/sandbox/bin # cp /bin/sh /usr/sandbox/bin/sh # chroot /usr/sandbox /bin/sh chroot: /bin/sh: Exec format error # mkdir /usr/sandbox/libexec # cp /libexec/ld.elf_so /usr/sandbox/libexec # chroot /usr/sandbox
- 
												  April 2006 Volume 31 Number 2APRIL 2006 VOLUME 31 NUMBER 2 THE USENIX MAGAZINE OPINION Musings RIK FARROW OpenSolaris:The Model TOM HAYNES PROGRAMMING Code Testing and Its Role in Teaching BRIAN KERNIGHAN Modular System Programming in MINIX 3 JORRIT N. HERDER, HERBERT BOS, BEN GRAS, PHILIP HOMBURG, AND ANDREW S. TANENBAUM Some Types of Memory Are More Equal Than Others DIOMEDIS SPINELLIS Simple Software Flow Analysis Using GNU Cflow CHAOS GOLUBITSKY Why You Should Use Ruby LU KE KANIES SYSADMIN Unwanted HTTP:Who Has the Time? DAVI D MALONE Auditing Superuser Usage RANDOLPH LANGLEY C OLUMNS Practical Perl Tools:Programming, Ho Hum DAVID BLANK-EDELMAN VoIP Watch HEISON CHAK /dev/random ROBERT G. FERRELL STANDARDS USENIX Standards Activities NICHOLAS M. STOUGHTON B O OK REVIEWS Book Reviews ELIZABETH ZWICKY, WITH SAM STOVER AND RI K FARROW USENIX NOTES Letter to the Editor TED DOLOTTA Fund to Establish the John Lions Chair C ONFERENCES LISA ’05:The 19th Large Installation System Administration Conference WORLDS ’05: Second Workshop on Real, Large Distributed Systems FAST ’05: 4th USENIX Conference on File and Storage Technologies The Advanced Computing Systems Association Upcoming Events 3RD SYMPOSIUM ON NETWORKED SYSTEMS 2ND STEPS TO REDUCING UNWANTED TRAFFIC ON DESIGN AND IMPLEMENTATION (NSDI ’06) THE INTERNET WORKSHOP (SRUTI ’06) Sponsored by USENIX, in cooperation with ACM SIGCOMM JULY 6–7, 2006, SAN JOSE, CA, USA and ACM SIGOPS http://www.usenix.org/sruti06 MAY 8–10, 2006, SAN JOSE, CA, USA Paper submissions due: April 20, 2006 http://www.usenix.org/nsdi06 2006
- 
												  In PDF FormatArranging Your Virtual Network on FreeBSD Michael Gmelin ([email protected]) January 2020 1 CONTENTS CONTENTS Contents Introduction 3 Document Conventions . .3 License . .3 Plain Jails 4 Plain Jails Using Inherited IP Configuration . .4 Plain Jails Using a Dedicated IP Address . .5 Plain Jails Using a VLAN IP Address . .6 Plain Jails Using a Loopback IP Address . .7 Adding Outbound NAT for Public Traffic . .7 Running a Service and Redirecting Traffic to It . .9 VNET Jails and bhyve VMs 10 VNET Jails Using sysutils/pot ............................. 10 VNET Jails Using sysutils/iocage ............................ 13 Managing Bridges . 13 Adding bhyve VMs and DHCP to the Mix . 16 Preventing Traffic Between VNET Jails/VMs . 17 Firewalling Inside VNET Jails/VMs . 20 VXLAN 22 VXLAN Example Overview . 22 Gateway Configuration . 24 Jailhost-a . 25 Network Configuration (jailhost-a) . 25 VM Configuration (jailhost-a) . 26 Jail Configuration (jailhost-a) . 27 Jailhost-b . 27 Network Configuration (jailhost-b) . 27 Plain Jail Configuration (jailhost-b) . 28 Network Switch Setup (jailhost-b) . 29 VNET Jail Configuration (jailhost-b) . 30 VM Configuration (jailhost-b) . 30 VXLAN Multicast Troubleshooting . 31 Conclusion and Further Reading 33 2020-01-08 (final) 2 CC BY 4.0 INTRODUCTION Introduction Modern FreeBSD offers a range of virtualization options, from the traditional jail environment sharing the network stack with the host operating system, over vnet jails, which allow each jail to have its own network stack, to bhyve virtual machines running their own kernels/operating systems. Depending on individual requirements, there are different ways to configure the virtual network. Jail and VM management tools can ease the process by abstracting away (at least some of) the underlying complexities.
- 
												  Running Untrusted Application Code: Sandboxing Running Untrusted CodeRunning Untrusted Application Code: Sandboxing Running untrusted code We often need to run buggy/unstrusted code: programs from untrusted Internet sites: toolbars, viewers, codecs for media player old or insecure applications: ghostview, outlook legacy daemons: sendmail, bind honeypots Goal : if application “misbehaves,” kill it Approach: confinement Confinement : ensure application does not deviate from pre-approved behavior Can be implemented at many levels: Hardware : run application on isolated hw (air gap) difficult to manage Virtual machines : isolate OS’s on single hardware System call interposition : Isolates a process in a single operating system Isolating threads sharing same address space: Software Fault Isolation (SFI) Application specific: e.g. browser-based confinement Implementing confinement Key component: reference monitor Mediates requests from applications Implements protection policy Enforces isolation and confinement Must always be invoked: Every application request must be mediated Tamperproof : Reference monitor cannot be killed … or if killed, then monitored process is killed too Small enough to be analyzed and validated A simple example: chroot Often used for “guest” accounts on ftp sites To use do: (must be root) chroot /tmp/guest root dir “/” is now “/tmp/guest” su guest EUID set to “guest” Now “/tmp/guest” is added to file system accesses for applications in jail open(“/etc/passwd”, “r”) ⇒⇒⇒ open(“/tmp/guest/etc/passwd”, “r”) ⇒ application cannot access files outside of jail Jailkit Problem: all utility progs (ls, ps, vi) must live inside jail • jailkit project: auto builds files, libs, and dirs needed in jail environment • jk_init : creates jail environment • jk_check: checks jail env for security problems • checks for any modified programs, • checks for world writable directories, etc.
- 
												  Jails and UnikernelsVirtualisation: Jails and Unikernels Advanced Operating Systems Lecture 18 Colin Perkins | https://csperkins.org/ | Copyright © 2017 | This work is licensed under the Creative Commons Attribution-NoDerivatives 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nd/4.0/ or send a letter to Creative Commons, PO Box 1866, Mountain View, CA 94042, USA. Lecture Outline • Alternatives to full virtualisation • Sandboxing processes: • FreeBSD jails and Linux containers • Container management • Unikernels and library operating systems Colin Perkins | https://csperkins.org/ | Copyright © 2017 2 Alternatives to Full Virtualisation • Full operating system virtualisation is expensive • Need to run a hypervisor • Need to run a complete guest operating system instance for each VM • High memory and storage overhead, high CPU load from running multiple OS instances on a single machine – but excellent isolation between VMs • Running multiple services on a single OS instance can offer insufficient isolation • All must share the same OS • A bug in a privileged service can easily compromise entire system – all services running on the host, and any unrelated data • A sandbox is desirable – to isolate multiple services running on a single operating system Colin Perkins | https://csperkins.org/ | Copyright © 2017 3 Sandboxing: Jails and Containers • Concept – lightweight virtualisation • A single operating system kernel • Multiple services /bin • Each service runs within a jail a service specific container, running just what’s needed /bin /dev for that application /dev /etc /etc /mail /home /home /jail /www /lib • Jail restricted to see only a subset of the / /lib /local /usr filesystem and processes of the host /usr /lib /sbin • A sandbox within the operating system /sbin /sbin /tmp /tmp /share /var • Partially virtualised /var Colin Perkins | https://csperkins.org/ | Copyright © 2017 4 Example: FreeBSD Jail subsystem Jails: Confining the omnipotent root.
- 
												  Security Scanning a Jail by BENEDICT REUSCHLING1 of 3 PRACTICAL Security Scanning a Jail BY BENEDICT REUSCHLING This column covers ports and packages for FreeBSD that are useful in some way, peculiar, or otherwise good to know about. Ports extend the base OS functionality and make sure you get something done or, simply, put a smile on your face. Come along for the ride, maybe you’ll find something new. n this installment, I stray a bit from presenting a bouquet of ports and just focus on a single one: security/lynis. Lynis is a tool for security auditing, hardening, and compliance testing. What lynis does dif- ferently from other security scanners is that it tries to detect available components on a sys- Item such as a webserver or database. Once it finds them, it checks them further for vulnerabil- ities, missing patches, etc. This way, the scans are different on each system based on the configured software and purpose. For example, your firewall Available as open host may receive different examinations than your backup server. Plugins extend lynis’s functionality to source as well as a cover specific software. A comprehensive security commercial enterprise report is generated at the end, eliciting either a pat on the back from your security-minded superiors or tool, lynis scans a range your next security sensitivity training. Available as open source as well as a commer- of operating systems for cial enterprise tool, it scans a range of operating systems for security anomalies. Sysadmins as well security anomalies. as penetration testers, developers, and auditors can use it to assess if there are any vulnerabilities, not only in installed software but also in their configurations.
- 
												  Iocell Documentation Release 1.7.3iocell Documentation Release 1.7.3 iocell Nov 23, 2018 Contents 1 Documentation: 3 1.1 Basic usage................................................3 1.2 Networking................................................6 1.3 Jail types.................................................8 1.4 Best practices............................................... 10 1.5 Advanced usage............................................. 11 1.6 How to create and use templates..................................... 15 1.7 Create a jail package!.......................................... 15 1.8 Create a Debian Squeeze jail (GNU/kFreeBSD)............................. 16 1.9 Known Issues............................................... 17 1.10 FAQ.................................................... 18 1.11 Indices and tables............................................ 19 i ii iocell Documentation, Release 1.7.3 iocell is a zero dependency drop in jail/container manager amalgamating some of the best features and technologies FreeBSD operating system has to offer. It is geared for ease of use with a simple and easy to understand command syntax. FEATURES: • Templates, clones, basejails, fully independent jails • Ease of use • Zero configuration files • Rapid thin provisioning within seconds • Automatic package installation • Virtual networking stacks (vnet) • Shared IP based jails (non vnet) • Resource limits (CPU, MEMORY, etc.) • Filesystem quotas and reservations • Dedicated ZFS datasets inside jails • Transparent ZFS snapshot management • Binary updates • Differential jail packaging • Export and import • And many more! Contents 1 iocell Documentation, Release 1.7.3 2 Contents CHAPTER 1 Documentation: 1.1 Basic usage This section is about basic use and is a suitable quick howto for newcomers. 1.1.1 Fetch a release The very first step with iocell is to fetch a RELEASE. By default iocell will attempt to fetch the host’s current RE- LEASE from the freebsd.org servers. Once the RELEASE bits are downloaded, the most recent patches are applied too.
- 
												  AWS Firecracker VMM 之 ⼤熱天捲起袖⼦動⼿玩Playing with AWS Firecracker VMM 之 ⼤熱天捲起袖⼦動⼿玩 ... Ernest Chiang @ COSCUP 2020, Track: Cloud Native Hub Give me a place to stand on, and I will move the Earth. —Archimedes 2 sli.do #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm #awsvmm 議程中有任何問題、好奇、疑問,都可以隨時丟進 sli.do US$25 AWS Credits 問券連結,也放在 sli.do 裡頭喔 3 Ernest Chiang Worked on process integration engineering in semiconductor industry @tsmc. Doing product and technology integration in fitness industry @pafers. Off Work TGO Networks Taipei. AWS Community Hero. Mozillian. AIESECer. 4 Outline Problems & Solutions Firecracker Virtualization & Containerization Lambda & Fargate Firecracker & container d Live Demo Getting started with Firecracker in 2 Minutes Creating 4,000 microVMs in 90 seconds Firecracker & Open Source Projects 8 Problems & Solutions 9 Firecracker, Part 1 10 What is Firecraker Firecracker is an open source VMM that is purpose-built for creating and managing secure, multi-tenant container and function-based services. 11 What is Firecraker Firecracker is an open source VMM that is purpose-built for creating and managing secure, multi-tenant container and function-based services. 12 What problem is AWS helping to solve? 13 What problem is AWS helping to solve? 14 What problem is AWS helping to solve? Multiple functions on multiple environments from multiple accounts . 15 What is Firecracker Open source virtualization technology (microVM) Security and isolation of traditional VMs Speed and density of containers Low resource overhead Developed at Amazon 16 Benefits of Firecracker 17 Benefits of Firecracker 18 安全隔離好 啟動時間短 產能效率⾼ #像極了愛情 -- AWS Firecracker VMM 19 Virtualization & Containerization 20 Virtualization (1/3) In computing, virtualization refers to the act of creating a virtual (rather than actual) version of something , including virtual computer hardware platforms, storage devices, and computer network resources.
- 
												  Bulk Ports Management with Poudriere1 of 3 Bulk Ports Management with Poudriere BY DREW GURKOWSKI Step 1: Using the Ports Collection to Install Poudriere All commands in this section should be run as root; using the su(1) command will work. To download a compressed snapshot of the Ports Collection into /var/db/portsnap: # portsnap fetch When running portsnap for the first time, extract the snapshot into/usr/ports : # portsnap extract Next, build and install Poudriere from the Ports Collection using: # cd /usr/ports/ports-mgmt/poudriere # make install clean Prompts will appear throughout the installation process, stick with the default settings and in- stall the port. Step 2: Configuring Poudriere Poudriere is an extremely powerful tool designed for package production, but it can also be used to bulk manage ports. In order to do so, a few small tweaks will need to be made to the configuration before moving forward. First, copy and move the configuration file to the correct location with: # cd /usr/local/etc # cp poudriere.conf.sample poudriere conf Then use the ee(1) text editor to edit the copied configuration file: # ee poudriere.conf Arrow keys can be used to navigate down to the line: FREEBSD_HOST=_PROTO_://_CHANGE_THIS_ Edit the above line, so it instead looks like this: (backspace to delete text) FREEBSD_HOST=ftp://ftp.freebsd.org Type ESCAPE and then ENTER twice to exit and save changes to the configuration file. FreeBSD Journal • May/June 2020 35 2 of 3 Step 3: Set Up a Poudriere Jail Before continuing, Poudriere will need to fetch and extract its own version of the FreeBSD Ports Collection.
- 
												  2-2 ContainersContainers: Docker and Kubernetes 17.1.2019 Santeri Paavolainen “Operating-system-level virtualization, also known as containerization, refers to an operating system feature in which the kernel allows the existence of multiple isolated user-space instances. Such instances, called containers, partitions, virtual environments (VEs) or jails (FreeBSD jail or chroot jail), may look like real computers from the point of view of programs running in them. A computer program running on an ordinary operating system can see all resources (connected devices, files and folders, network shares, CPU power, quantifiable hardware capabilities) of that computer. However, programs running inside a container can only see the container's contents and devices assigned to the container.” Wikipedia: Operating-system-level virtualization COM-EV Microservice architectures and serverless computing 2019 15.1.2019 2 1. Hardware isolation 2. Virtual machine process isolation 3. 3. Process isolation process 4. Container 2. isolation 4. container 1. guest os virtual machine vm hypervisor hardware hardware COM-EV Microservice architectures and serverless computing 2019 15.1.2019 3 Container technologies - LXC: Linux Containers - OS-level process isolation = cgroups + isolated namespaces - Docker uses LXC - Windows … - Windows Server Containers and Hyper-V Isolation, but … - Reality: Windows and MacOS - Docker actually runs a Linux VM where Docker uses LXC - The fact that containers are run in a separate virtual machine can cause issues with volume mounts and networks!