Google Cloud. | 1

Total Page:16

File Type:pdf, Size:1020Kb

Google Cloud. | 1 google cloud. | 1 how the google cloud works. *this is a study guide that was created from lecture videos and is used to help you gain an understanding of how the google cloud works. Google Cloud Services Foundations GCP- google cloud platform compute for virtualization, storage and other services Compute Services- virtual machines, containers and container management (Docker) and functions and lambdas.serverless computing Storage Services- files, archival storage, persistent disks for virtual machines Data Services- NoSQL, RDBMS, Hadoop and Spark Big Data Services- data pipelines, data warehouses, data science notebooks, machine learning Other services- security and identity, management and monitoring from Stackdriver, developer tools Why Use Google Cloud?- Cloud services allow you to get to market faster. Scalable services save time for developers and DevOps. Automatic usage discounts equal less management of pricing. mind movement machine. google cloud. | 2 Automatic scaling- build app and has as many customers as possible without running out of resources. The default is autoscale until you put a limit. GCP is fast, has frequent improvements, autoscales massively and a good value. GCP has an engineering focus and has documentation challenges and pricing surprises. Gcutil is a tool that is a part of the SDK and is used from the command line. Use gmail account -> use free trial -> have credit card available Working with the console- Google Cloud Console is a browser or web page that gives you the ability to work with cloud services that are available on GCP. Google works best in Chrome. Setting the location, where physically are the virtual machines, databases and etc. This is called data center region. Within GCP, A top level or container is called a project, inside a project you enable services and you set what service should be running. Projects have three associated values: ID, name or number. Find this information on the bottom right on your GCP project page. Set a budget in GCP in the budgets & alerts tab to ensure you do not go over your spend. This is a crucial step when learning how the google cloud works. In GCP, you want to think about making the container for the services. You would create your project, enable API’s and then set up the tools. Most API’s are turned on but some you have to enable manually. mind movement machine. google cloud. | 3 Find API Services- GCP Console -> Home -> left side menu -> compute, storage, stackdriver, tools, big data Pin the menu tab for easy access to services you use. Enabling API’s- GCP Console -> API manager -> look to see which API’s are enabled. -> enable API -> Click the API you want to turn on -> Enable. You adjust API’s at the project level. Google Cloud Security IAM is Identity and Access Management, which is where you can manage user roles and permissions. You can set quotas to disable auto scaling. Adjust IAM- GCP Console -> menu -> IAM & Admin -> ‘compute engine’ is enabled by default -> Add or Edit Permissions of Users in this window -> Set Quotas in the left menu -> Create Service Account in left side menu. Service accounts are for services and IAM is for people or user accounts. GCloud Download GCP SDK -> start gcloud from your terminal and project folder -> authenticate to gcloud -> list or set your project. $ gloud version- command that will tell you the version of the client you have. Verify which version you have before you get started working. $ gcloud auth login- authenticates the user $ gcloud info –show-log – will show the log mind movement machine. google cloud. | 4 $ gcloud config set project (my-project-id> – you can set your projects $ gcloud init <my-project> – you can create a project Make sure you are at the directory where your project is stored locally Gcloud uses two dashes in arguments, copying and pasting can break the code. Set up command line access with gcloud- download mac file -> extract the file -> run the install script -> change directory to script -> run install script -> yes to set path -> set default -> close shell -> reopen shell and run $gcloud init command -> ->Login to Gmail to ensure you are authenticated -> return to terminal and run $gcloud version to see the working version Index into github by GCP for tutorials and code examples. – Google Cloud github GCP projects -> three dot menu on right side-> try an interactive tutorial -> select tutorial -> create a bucket on the google cloud for the tutorial GCP to AWS Reference- GCP vs AWS mind movement machine. google cloud. | 5 Google Cloud Platform Compute Services Virtual Machines, Containers and Functions Compute Services: App Engine, Compute Engine, Container Engine, Networking Google Compute Service is a cloud hosted virtual machine that is an infrastructure as a service. It is enabled by default. You can create a Google Cloud Engine instance using the Google Cloud Engine Console, you can also use gcloud or scripting to regenerate instances in production environments. You can then connect to a Lynx machine and will be able to connect to the SSH, you can then use your virtual machine. For production, you would set other networking options such as security and firewalls GCP Compute Engine -> create instance -> create -> connect via SSH Customize machine type to save budget, you can change the OS to other than Linux Machines. You can set firewall rules to allow HTTP or HTTPS traffic. Scripting, metadata, and preamptibility, which is used for batch processing, are all features of a GCP Compute Instance. Google Cloud Launcher Automation for launching customized virtual machines. These can be virtual machines with applications on the Compute Engine. These can be databases or components that are on preconfigured virtual machines. A deployment is a container for one or more services, used for reproducibility. Deployment manager prints output as deployment is occurring, it will take configuration and allow you to script it. The deployment manager wraps around the Google Compute Engine. mind movement machine. google cloud. | 6 Use Cloud Launcher for setting up an IDE- GCP Home -> cloud launcher -> developer tools -> eclipse Che (open source developer workspace server and cloud IDE) -> launch on compute engine -> deploy -> wait for admin password to populate and copy it -> login to admin panel with admin user and password -> software will run on VM -> you will then have a browser- based IDE to code in Java that is being made on a Google Cloud -> go to compute engine you can see the instance of the VM you are running. Google Compute Engine Instance Groups lets you setup virtual machines for load balancing. Load balancing is explained by letting you put how many instances that will be running and load balancing will monitor and send a health check to ensure you can create new instances with ease. Instance Templates can be used within Instance Groups. Compute and Storage are seperate. You will save money when Compute is turned off for instances you are not using. You can create a snapshot as a basis for a new VM. Metadata is used for tags. Health checks allow you to run health checks against your instances. Zones will explain how many instances you have in the zones, which is used when there are many VM’s you are using. Operations is everything that has happened for activity history. Quota is how much resources you are using. Deployment Manager -> click on deployment -> you can delete or adjust deployment of VM’s within deployment manager. Google Container Engine Application Virtualization is a more efficient method to store, compute and maintain certain mind movement machine. google cloud. | 7 applications. The hypervisor that is typically used for virtualization is replaced with the Docker Engine. This will remove the need for the guest OS to be put in each container. Container Engine is Google’s product and is Google’s implementation of Docker. Kubernetes for container management is used as a way to manage application virtualizations after being setup. Difference between containers and virtual machines- containers only need one operating system for as many containers. Containers still sit on top of virtual machines but they do not deal with a hypervisor to control the instances. Instead you use a container manager, which is the Container Engine for Google Cloud. Containers use caching and allows for quicker application scaling up. Containers are easier to replicate because the setup is in code that you can access. It is easier to use a container to replicate an application than a virtual machine. GCP Console Menu -> container engine -> create a container cluster -> fill out and create container cluster -> to see source code of container: GCP Console menu -> development A Dockerfile is a simple text file where you put the configuration of the container. GCP Console: Container engine, wait for cluster to complete with green check mark -> Access gcloud with button at top right -> (gcloud tool can be faster than authenticating through an installed client because you are logging into gcloud through being logged into your Google Console account) -> clobse sample code and paste into gcloud shell -> get gcloud credentials for clusters -> use docker command to build and push the application and bring down the image -> push ithe image into your cluster with a gcloud space docker push command (by mind movement machine. google cloud. | 8 pushing the image up, that is when application virtualization is happening) -> now that the image is in Kubernetes, use kubernetes control command to run the app from the image -> when you run kubernetes command to expose the deployed image then GCP will assign an external IP address to the container node, so you will be able to look at the service -> you can scale your application that containerized and running on the web -> use kubernetes commands to scale up and see if the scaling works.
Recommended publications
  • Real-Time Backend Architecture Using Node.Js, Express and Google Cloud
    Anh Vu Real-time backend architecture using Node.js, Express and Google Cloud Platform Metropolia University of Applied Sciences Bachelor of Information Technology Information Technology Bachelor’s Thesis 5 January 2021 Abstract Anh Vu Author Real-time backend using NodeJS, Express and Google Cloud Title Platform NumBer of Pages 47 pages Date 5 January 2021 Degree Bachelor of Engineering Degree Programme Information Technology Professional Major MoBile Solutions Instructors Petri Vesikivi, Head of MoBile Solutions Real-time applications, which assure the latency within the defined time limit, are becoming more popular due to the growth of Software as a service trend. Before the evolution of cloud computing, the only solution was to use native WebSockets which are difficult to set up and develop. Recently, Google Cloud Platform provides a developer-friendly, fast and responsive platform to make the process of developing real-time applications seamless. The purpose of the thesis was to demonstrate and Build a scalaBle, high-available and reliaBle Backend architecture using Node.js and Google Cloud Platform. The thesis consists of a theoretical background including Node.js, monolithic and microservices architecture, serverless architecture and real-time dataBase, which provide basic understanding of different architectures and technical solutions. The advantages and disadvantages of the architecture were also clearly analyzed and evaluated. Furthermore, a minimum viable product for a taxi booking app was created to demonstrate the architecture usage in a real use case. To summarize, the thesis aimed to provide the insights of real-time Backend architecture using Node.js and Google Cloud Platform. Moreover, the benefits of using this technology stacks were carefully examined in a case study.
    [Show full text]
  • Google-Cloud Documentation Release 0.20.0
    google-cloud Documentation Release 0.20.0 Google Cloud Platform October 06, 2016 google-cloud 1 Base Client 1 2 Credentials Helpers 5 3 Base Connections 9 4 Exceptions 13 5 Environment Variables 17 6 Configuration 19 6.1 Overview................................................. 19 6.2 Authentication.............................................. 19 7 Authentication 21 7.1 Overview................................................. 21 7.2 Client-Provided Authentication..................................... 21 7.3 Explicit Credentials........................................... 22 7.4 Troubleshooting............................................. 23 7.5 Advanced Customization......................................... 24 8 Long-Running Operations 27 9 Datastore Client 29 9.1 Connection................................................ 32 10 Entities 37 11 Keys 39 12 Queries 43 13 Transactions 47 14 Batches 51 15 Helpers 55 16 Storage Client 57 16.1 Connection................................................ 59 i 17 Blobs / Objects 61 18 Buckets 69 19 ACL 77 20 Batches 81 21 Using the API 83 21.1 Authentication / Configuration...................................... 83 21.2 Manage topics for a project....................................... 83 21.3 Publish messages to a topic....................................... 84 21.4 Manage subscriptions to topics..................................... 84 21.5 Pull messages from a subscription.................................... 86 22 Pub/Sub Client 87 22.1 Connection................................................ 88
    [Show full text]
  • Google Certified Professional - Cloud Architect.Exam.57Q
    Google Certified Professional - Cloud Architect.exam.57q Number : GoogleCloudArchitect Passing Score : 800 Time Limit : 120 min https://www.gratisexam.com/ Google Certified Professional – Cloud Architect (English) https://www.gratisexam.com/ Testlet 1 Company Overview Mountkirk Games makes online, session-based, multiplayer games for the most popular mobile platforms. Company Background Mountkirk Games builds all of their games with some server-side integration, and has historically used cloud providers to lease physical servers. A few of their games were more popular than expected, and they had problems scaling their application servers, MySQL databases, and analytics tools. Mountkirk’s current model is to write game statistics to files and send them through an ETL tool that loads them into a centralized MySQL database for reporting. Solution Concept Mountkirk Gamesis building a new game, which they expect to be very popular. They plan to deploy the game’s backend on Google Compute Engine so they can capture streaming metrics, run intensive analytics, and take advantage of its autoscaling server environment and integrate with a managed NoSQL database. Technical Requirements Requirements for Game Backend Platform 1. Dynamically scale up or down based on game activity 2. Connect to a managed NoSQL database service 3. Run customize Linux distro Requirements for Game Analytics Platform 1. Dynamically scale up or down based on game activity 2. Process incoming data on the fly directly from the game servers 3. Process data that arrives late because of slow mobile networks 4. Allow SQL queries to access at least 10 TB of historical data 5. Process files that are regularly uploaded by users’ mobile devices 6.
    [Show full text]
  • Studi Perbandingan Layanan Cloud Computing
    Jurnal Rekayasa Elektrika Vol. 10, No. 4, Oktober 2013 193 Studi Perbandingan Layanan Cloud Computing Afdhal Jurusan Teknik Elektro, Fakultas Teknik, Universitas Syiah Kuala Jl. Tgk. Syech Abdurrauf No. 7 Darussalam, Banda Aceh 23111 e-mail: [email protected] Abstrak—Selama beberapa tahun terakhir, cloud computing telah menjadi topik dominan dalam bidang teknologi informasi dan komunikasi (TIK). Saat ini, cloud computing menyediakan berbagai jenis layanan, antara lain layanan hardware, infrastruktur, platform, dan berbagai jenis aplikasi. Cloud computing telah menjadi solusi dan pelayanan, baik untuk meningkatkan kehandalan, mengurangi biaya komputasi, sampai dengan memberikan peluang yang cukup besar bagi dunia industri TIK untuk mendapatkan keuntungan lebih dari teknologi ini. Namun disisi lain, pengguna akhir (end user) dari layanan ini, tidak pernah mengetahui atau memiliki pengetahuan tentang lokasi fisik dan sistem konfigurasi dari penyedia layanan ini. Tujuan artikel ini adalah untuk menyajikan sebuah pemahaman yang lebih baik tentang klasifikasi-klasifikasi cloud computing, khususnya difokuskan pada delivery service (pengiriman layanan) sebagai model bisnis cloud computing. Artikel ini membahas model-model pengembangan, korelasi dan ketergantungan antara satu model layanan dengan model layanan yang lainnya. Artikel ini juga menyajikan perbandingan dan perbedaan tingkatan-tingkatan dari model delivery service yang dimulai dari mengidentifikasi permasalahan, model pengembangan, dan arah masa depan cloud computing. Pemahaman tentang klasifikasi delivery service dan isu-isu seputar cloud computing akan melengkapi pengetahuan penggunanya untuk menentukan keputusan dalam memilih model bisnis mana yang akan diadopsi dengan aman dan nyaman. Pada bagian akhir artikel ini dipaparkan beberapa rekomendasi yang ditujukan baik untuk penyedia layanan maupun untuk pengguna akhir cloud computing. Kata kunci: cloud computing, delivery service, IaaS, PaaS, SaaS Abstract—In the past few years, cloud computing has became a dominant topic in the IT area.
    [Show full text]
  • Every App in the Universe
    THE BIGGER BOOK OF APPS Resource Guide to (Almost) Every App in the Universe by Beth Ziesenis Your Nerdy Best Friend The Bigger Book of Apps Resource Guide Copyright @2020 Beth Ziesenis All rights reserved. No part of this publication may be reproduced, distributed, or trans- mitted in any form or by any means, including photocopying, recording or other elec- tronic or mechanical methods, without the prior written permission of the publisher, except in the case of brief quotations embodied in critical reviews and certain other non- commercial uses permitted by copyright law. For permission requests, write to the pub- lisher at the address below. Special discounts are available on quantity purchases by corporations, associations and others. For details, contact the publisher at the address below. Library of Congress Control Number: ISBN: Printed in the United States of America Avenue Z, Inc. 11205 Lebanon Road #212 Mt. Juliet, TN 37122 yournerdybestfriend.com Organization Manage Lists Manage Schedules Organize and Store Files Keep Track of Ideas: Solo Edition Create a Mind Map Organize and Store Photos and Video Scan Your Old Photos Get Your Affairs in Order Manage Lists BZ Reminder Pocket Lists Reminder Tool with Missed Call Alerts NerdHerd Favorite Simple To-Do List bzreminder.com pocketlists.com Microsoft To Do Todoist The App that Is Eating Award-Winning My Manager’s Favorite Productivity Tool Wunderlist todoist.com todo.microsoft.com Wunderlist Plan The Award-Winning Task Manager with a Task Manager and Planning Tool Rabid Fanbase
    [Show full text]
  • Choosing a Database- As-A-Service an Overview of Offerings by Major Public Cloud Service Providers
    CHOOSING A DATABASE- AS-A-SERVICE AN OVERVIEW OF OFFERINGS BY MAJOR PUBLIC CLOUD SERVICE PROVIDERS Warner Chaves Principal Consultant, Microsoft Certified Master, Microsoft MVP With Contributors Danil Zburivsky, Director of Big Data and Data Science Vladimir Stoyak, Principal Consultant for Big Data, Certified Google Cloud Platform Qualified Developer Derek Downey, Practice Advocate, OpenSource Databases Manoj Kukreja, Big Data and IT Security Specialist, CISSP, CCAH and OCP When it comes to running your data in the public cloud, there is a range of Database-as-a-Service (DBaaS) offerings from all three major public cloud providers. Knowing which is best for your use case can be challenging. This paper provides a high-level overview of the main DBaaS offerings from Amazon, Microsoft, and Google. After reading this white paper, you’ll have a high-level understanding of the most popular data repositories and data analytics service offerings from each vendor, you’ll know the key differences among the offers, and which ones are best for each use case. With this information, you can direct your more detailed research to a manageable number of options. www.pythian.com | White Paper 1 This white paper does not discuss private cloud providers or colocation environments, streaming, data orchestration, or Infrastructure-as-a-Service (IaaS) offerings. This paper is targeted to IT professionals with a good understanding of databases and also business people who want an overview of data platforms in the cloud. WHAT IS A DBAAS OFFERING? A DBaaS is a database running in the public cloud. Three things define a DBaaS: • The service provider installs and maintains the database software, including backups and other common database administration tasks.
    [Show full text]
  • Announcement
    Announcement 68 articles, 2016-03-23 18:03 1 Android Pay is coming to the UK, Google’s first market in Europe (2.00/3) Now signing up 1.5 million users each week in the U. S., Android Pay, Google's payment platform and mobile wallet for Android devices and apps, is coming to.. 2016-03-23 18:02 2KB feedproxy.google.com 2 Bill Gates calls for tighter safeguards on bulk data collection Gates discusses the iPhone case, AI and the Microsoft Surface Book in his latest Reddit chat,Business Software,Privacy ,Microsoft,Bill Gates,Windows 10,Artificial Intelligence 2016-03-23 18:02 2KB www.computing.co.uk 3 Opera to add native ad blocker to desktop web browser Need for speed? Opera's web browser will come with a built-in speedometer ,Software,Internet,Internet of Things,Cloud and Infrastructure ,Opera,Opera Software,Browsers 2016-03-23 18:02 2KB www.computing.co.uk 4 US Department of Defense to migrate millions of PCs to Windows 10 - in just one year DoD to migrate four million desktops in just one year - or, at least, that's the plan,Operating Systems,Software ,Windows,Windows 10,Microsoft,Department of Defense,operating system 2016-03-23 18:02 3KB www.computing.co.uk 5 AI is coming, so we'd better start preparing for it Francois Tung of law firm Pinsent Masons examines the legal implications of rapidly improving artificial intelligence,Cloud and Infrastructure,Software,Legislation and Regulation ,Artificial Intelligence,Machine Learning,Pinsent Masons 2016-03-23 18:02 1017Bytes www.computing.co.uk 6 Security flaw affecting thousands of Linux apps
    [Show full text]
  • GAE — Google App Engine
    GAE — Google App Engine Prof. Dr. Marcel Graf TSM-ClComp-EN Cloud Computing (C) 2017 HEIG-VD Google App Engine Introduction ■ Google App Engine is a PaaS for building scalable web applications and mobile backends. ■ Makes it easy to deploy a web application: ■ The client (developer) supplies the application’s program code. ■ Google’s platform is responsible for running it on its servers and scaling it automatically. ■ The platform offers built-in services and APIs such as NoSQL datastore, memcache and user authentication. ■ There are some restrictions regarding the supported application types. ■ The application cannot access everything on the server. ■ The application must limit its processing time. ■ Launched in April 2008 (preview) ■ In production since September 2011 ■ Supported programming languages and runtimes: ■ Go ■ Node.js ■ PHP ■ .NET ■ Java ■ Ruby ■ Python ■ … TSM-ClComp-EN Cloud Computing | Google App Engine | Academic year 2017/2018 2 Platform as a Service Reminder: Web applications in Java — Servlets ■ A servlet runs inside an application server ■ The server calls the servlet to handle an HTTP request ■ The task of the servlet is to create the response Application server HTTP request Servlet HTTP response Browser Servlet Database Servlet TSM-ClComp-EN Cloud Computing | Google App Engine | Academic year 2017/2018 3 Platform as a Service Reminder: Web applications in Java — Servlets ■ Complete picture of the processing of an HTTP request Application Servlet servier Browser Database HTTP request HTTPServlet Request object URL
    [Show full text]
  • Create a Created Date Schema in Bigquery
    Create A Created Date Schema In Bigquery Denominative Elmore canoodles funereally. Auxiliary Rudy acquiesce: he unbar his aril adequately and contagiously. Overreaching Randolf excused some scrabble and shacks his syneresis so midnight! Wait polls with exponential backoff. The schema in tests are many cases. Perform various levels of dates in schema definition file somewhere on create date column when you created in an identical data will lead at. You'd may that avoiding two scans of SALES would improve performance However remember most circumstances the draw BY version takes about twice as suite as the lot BY version. You run some SQL queries against that data. The script then creates a partition function, if configured. The order in which clustering is done matters. Developer relations lead at SKB Kontur. We create schema in the dates and their very wide variety of bigquery operation creates a working directory as a few seconds of model you. By using the client libraries. Database services to migrate, please leave it empty. What are partitions in SQL? In our case the join condition is matching dates from the pedestrian table and the bike table. The mother source records need is have a machine image updates to merge correctly. Python community version selection works in schema of bigquery operation on create date column can be created in a new custom schema tab. Once per store the data needs to simplify etl to any data are also a bigquery? Google Analytics 360 users that have that up the automatic BigQuery export will rejoice by this. Whether to automatically infer options and schema for CSV and JSON sources.
    [Show full text]
  • Google Genomics Documentation Release V1
    Google Genomics Documentation Release v1 Google Genomics <https://cloud.google.com/genomics> Mar 07, 2018 Contents 1 Select Genomic Data to work with3 1.1 Discover Published Data.........................................3 2 Process Data on Google Cloud 19 2.1 Run workflows and common tasks in parallel.............................. 19 2.2 Run Galaxy on Compute Engine..................................... 43 2.3 Run NCBI BLAST on Compute Engine................................. 43 2.4 Run Bioconductor on Compute Engine................................. 43 2.5 Run iPython Notebooks on Compute Engine.............................. 45 3 Access Genomic Data using. 47 3.1 Integrative Genomics Viewer (IGV)................................... 47 3.2 Run Picard and GATK tools on Cloud-Resident Genomic Data..................... 49 3.3 Browse Reads with Bioconductor.................................... 54 3.4 Beacon.................................................. 55 3.5 GABrowse................................................ 55 3.6 The R client............................................... 56 3.7 Python.................................................. 56 3.8 Java.................................................... 57 3.9 Go.................................................... 57 4 Analyze Data in Google Genomics 59 4.1 Analyze Reads.............................................. 59 4.2 Analyze Variants............................................. 67 4.3 Annotate Variants............................................ 101 4.4 Perform Quality Control Checks....................................
    [Show full text]
  • Overview Objectives Exploring the Bigquery Console
    CS6502 - APPLIED BIG DATA AND VISUALIZATION (Spring 2020) Lab 3 Overview SQL (Structured Query Language) is a standard language for data operations that allows you to ask questions and get insights from structured datasets. It's commonly used in database management and allows you to perform tasks like transaction record writing into relational databases and petabyte-scale data analysis. This lab serves as an introduction to SQL and is intended to prepare you for the many labs in this course. This lab is divided into two parts: in the first half, you will learn fundamental SQL querying keywords, which you will run in the BigQuery console on a public dataset that contains information on London bikeshares. In the second half, you will learn how to export subsets of the London bikeshare dataset into CSV files, which you will then upload to Cloud SQL. From there you will learn how to use Cloud SQL to create and manage databases and tables. Towards the end, you will get hands-on practice with additional SQL keywords that manipulate and edit data. Objectives In this lab, you will learn how to: ● Distinguish databases from tables and projects. ● Use the SELECT, FROM, and WHERE keywords to construct simple queries. ● Identify the different components and hierarchies within the BigQuery console. ● Load databases and tables into BigQuery. ● Execute simple queries on tables. ● Learn about the COUNT, GROUP BY, AS, and ORDER BY keywords. ● Execute and chain the above commands to pull meaningful data from datasets. ● Export a subset of data into a CSV file and store that file into a new Cloud Storage bucket.
    [Show full text]
  • An Analytic Real Life Case Study of Google Cloud Computing Architecture Platform’S for Big Data Analytics Products from AWS, Azure and GCP
    Vol-6 Issue-5 2020 IJARIIE-ISSN(O)-2395-4396 An Analytic Real Life Case Study of Google Cloud computing architecture Platform’s for Big Data Analytics Products from AWS, Azure and GCP Devendra Kumar Nagayach1, Dr. Pushpneel Verma2 1Research Scholar, Deptt. of Computer Science & Application, Bhagwant University, Rajasthan, India 2Associate Professor, Bhagwant University, Rajasthan, India ABSTRACT A large amount of data, structured or unstructured, is called "Big Data", which mainly consists of five properties such as volume, velocity and variation, validation and value of which value is the most important whose main purpose is to extract relevant information. The other four V’s to solve this problem of polite data and analysis; various technologies have emerged in the last decades. "Cloud computing" is a platform where thousands of servers work together to meet various computing needs and billing is done as per 'pay-as-you-go' increases. This study will present a novel dynamic scaling methodology to improve the performance of big data systems. A dynamic scaling methodology will be developed to scale the system from a big data perspective. Furthermore, these aspects will be used by the algorithm of the supporting project, which can be broken down into smaller tasks to be processed by the system. These small bangles will be run on multiple virtual machines to perform parallel work to increase the runtime performance of the system. Keyword: - SCC, GFRSCC, Properties, and EFNARC etc. 1. INTRODUCTION Currently, we live in an era where big data has emerged and is attracting attention in many fields such as science, healthcare, business, finance and society.
    [Show full text]