Automating Pipelines with Mixed-Semantics Data Sources

Total Page:16

File Type:pdf, Size:1020Kb

Automating Pipelines with Mixed-Semantics Data Sources Koji: Automating pipelines with mixed-semantics data sources Petar Maymounkov Aljabr, Inc. [email protected] ABSTRACT dependencies, which are usually other les). e simple dataow We propose a new result-oriented semantic for dening data pro- model mentioned previously no longer applies. cessing workows that manipulate data in dierent semantic forms To the best of our knowledge, no dataow model has been pro- (les or services) in a unied manner. is approach enables users to posed to capture this scenario. Yet, to this day, this type of mixed- dene workows for a vast variety of reproducible data-processing semantic (le and service) pipelines represent the most common tasks in a simple declarative manner which focuses on application- type of o-line batch-processing workows. level results, while automating all control-plane considerations (like Due to the lack of a specialized formalism for describing them failure recovery without loss of progress and computation reuse) and a tool for executing them, they are currently codied in a behind the scenes. variety of error-prone ways, which usually amount to the usage of e uniform treatment of les and services as data enables easy data-unaware task execution pipelines. We address this gap here. integration with existing data sources (e.g. data acquisition APIs) and sinks of data (e.g. database services). Whereas the focus on con- 1.2 Problem tainers as transformations enables reuse of existing data-processing We address a class of modern Machine Learning and data-processing systems. pipelines. We describe a declarative conguration mechanism, which can Such pipelines transform a set of input les through chains of be viewed as an intermediate representation (IR) of reproducible transformations, provided by open-source soware (OSS) for large- data processing pipelines in the same spirit as, for instance, Tensor- scale computation (like TensorFlow [12], Apache Beam [6], Apache Flow [12] and ONNX [16] utilize IRs for dening tensor-processing Spark [8], etc.) or user-specic implementations (usually provided pipelines. as executable containers). In short, these pipelines use mixtures of disparate OSS technologies tied into a single coherent data ow. ACM Reference format: Petar Maymounkov. 2016. Koji: Automating pipelines with mixed-semantics At present, such workows are frequently built using task-driven data sources. In Proceedings of White paper, Brooklyn, NY, 2018, 11 pages. pipeline technologies (like Apache Airow [5]) which execute tasks DOI: 10.1145/nnnnnnn.nnnnnnn in a given dependency order, but are unaware of the data passed from one task to the next. e lack of data ow awareness of current solutions prevents large data-processing pipelines from beneting 1 INTRODUCTION in caching and reuse of computation, which could provide signi- 1.1 History cant eciency gains in these industry cases: • Restarting long-running pipelines aer failure and contin- e introduction of MapReduce [4] by Google arguably marked the uing from where previous executions le o. beginning of programmable large-scale data processing. MapRe- • Re-running pipelines with incremental changes, during duce performs a transformation of one set of large les (the input) developer iterations. into another (the output). Since the transformation provided by • Running concurrent pipelines which share logic, i.e. com- a MapReduce is a primitive a many-to-many shue, followed pute identical data artifacts within their workow. by an element-wise map it became common practice to chain multiple MapReduce transformations in a pipeline. Furthermore, task-centric technologies make it impractically e dataow in such a pipeline is cleanly captured by a directed- hard to integrate data and computation optimizations like: arXiv:1901.01908v1 [cs.DC] 2 Dec 2018 acyclic graph (DAG), whose vertices represent transformations and • In-memory storage of intermediate results which are not edges represent les. cache-able, or In a twist, it became commonplace to query a service (usually • Context-specic choices of job scheduling and placement a key-value lookup service) from inside the mapper function. For algorithms. instance, this technique is used to join two tables by mapping over one of them and looking up into the other. More recently, 1.3 Solution Machine Learning systems have been serving trained models as We propose a new pipeline semantic (and describe its system ar- lookup services, which are used by MapReduce mappers in a similar chitecture, which can be realized on top of Kubernetes) based on a fashion. few key design choices: With this twist, a MapReduce transformation no longer depends • Result-oriented specication: e goal of workows is just on input les but also on lookup services (and their transitive to build data artifacts. Workows are represented as depen- dency graphs over artifacts. Input artifacts are provided White paper, Brooklyn, NY 2018. 978-x-xxxx-xxxx-x/YY/MM...$15.00 by the caller. Intermediate artifacts are produced through DOI: 10.1145/nnnnnnn.nnnnnnn a computation, using prior artifacts. Output artifacts are White paper, 2018, Brooklyn, NY Petar Maymounkov returned to the caller. is view is entirely analogous to Kubernetes [13], which is becoming the industry-standard clus- the way soware build systems (like Bazel [11] and UNIX ter OS, is ubiquitously available on most cloud providers, can be make) dene soware build dependencies. run on-premises, and is generally provider-agnostic from the users’ • Unied treatment of data and services: We view le standpoint. Kubernetes benets from having mature integrations artifacts and service artifacts in a unied way as resources. to the Docker (and other) container ecosystems, providing seamless is allows us to describe complex workows which mix- out-of-box access to many data tools, thanks to the rich ecosystem and-match batch and streaming computations (the laer of operator implementations [14]. being a special case of services). Furthermore, this enables us to automate service garbage-collection and achieve op- 1.4 How it works timal computation reuse (via caching) across the entire e user denes a pipeline in a language-agnostic manner. A pipeline. e resource-level unied view of les and ser- pipeline denition describes: (i) input data resources and their vices purports to be the Goldilocks level of coarse data sources, (ii) intermediate resources and the data transformation knowledge, that is needed by a dataow controller to auto- that produced them from dependent resources, (iii) output data mate all le caching and service control considerations. resources and where they should be delivered. • Type-safe declarative specication: We believe that • Resources are les (or services) and their format (or proto- workow specication has to be declarative, i.e. repre- col) can be optionally specied to benet from type-safety sentable via a typed schema (like e.g. Protocol Buers). checks over the dataow graph, thanks to declarations. is provides full decoupling from implementations, and • Input resources can be provided by various standard meth- serves as a reproducible assembly language for dening ods (volume les, existing cluster services, Amazon S3 pipelines. buckets, and so on.). Data source types can be added seam- • Decouple dataow from transform implementation: lessly. We decouple the specication of application logic from • Intermediate resources are described as les (or services), the denition of how data transforms are performed by optionally with a specied format (or protocol). eir place- underlying backend technologies. Application logic com- ment location is not provided by the user, in order to enable prises the dependency graph between artifacts, and the the pipeline controller to make optimal choices in this re- data transform at each node. Data transforms are viewed gard and to manage caching placements decisions. uniformly akin to functions from a library of choices. e • Output resources can specify the location where they should methods for invoking transformation-backing technolo- be delivered, with standard (and extensible) options simi- gies (like MapReduce, TensorFlow, etc.) are implemented larly to the case for input resources. separately as driver functions, and surfaced as a library of e transformations at the nodes of the dataow graph consume declarative structures that can be used in application logic. a set of input resources to produce new resources. Transformations • Extensible transformations: New types of data trans- are exposed to the user with a clean application-level interface. A forms (other than container-execution based) can be added transformation: easily. is is done in two parts. First, a simple driver function implements the specics of calling the underly- • Consumes a set of named input resources (the arguments), ing technology. Second, a new transformation structure which can be fullled by prior resources in the user’s is added to the application logic schema. is extension dataow program. mechanism is reserved for transformations that cannot be • Produces a set of named output resources. e laer can containerized. be referenced (by name) by dependent transformations, • Scheduler and storage-independent design: Applica- downstream in the dataow program. tion logic governs the order in which data computations • Accepts a transformation-specic set of parameters. For must occur. However, e choice of job schedulers (or instance, a TensorFlow transformation may require a Ten- placement) algorithms, as well as the choice of storage sorFlow program (e.g. as a Python or Protocol Buer le). technologies (e.g. disk versus memory volumes), are en- e user programs (in Python or Go) which build the pipeline tirely orthogonal to the application’s dataow denition. dataow are used to generate a Protocol Buer (or YAML) le, Our architecture enables exible choice of relevant tech- which captures the entire pipeline and is eectively executable and nology on a per-node (scheduling) and per-edge (storage) reproducible. basis. For instance, some intermediate les can be stored Pipelines can be executed either from the command-line or by in memory volumes, instead of disk, to increase eciency. sending them o as a background job to Kubernetes, using the operator paern (via CRD).
Recommended publications
  • Selenium Python Bindings Release 2
    Selenium Python Bindings Release 2 Baiju Muthukadan Sep 03, 2021 Contents 1 Installation 3 1.1 Introduction...............................................3 1.2 Installing Python bindings for Selenium.................................3 1.3 Instructions for Windows users.....................................3 1.4 Installing from Git sources........................................4 1.5 Drivers..................................................4 1.6 Downloading Selenium server......................................4 2 Getting Started 7 2.1 Simple Usage...............................................7 2.2 Example Explained............................................7 2.3 Using Selenium to write tests......................................8 2.4 Walkthrough of the example.......................................9 2.5 Using Selenium with remote WebDriver................................. 10 3 Navigating 13 3.1 Interacting with the page......................................... 13 3.2 Filling in forms.............................................. 14 3.3 Drag and drop.............................................. 15 3.4 Moving between windows and frames.................................. 15 3.5 Popup dialogs.............................................. 16 3.6 Navigation: history and location..................................... 16 3.7 Cookies.................................................. 16 4 Locating Elements 17 4.1 Locating by Id.............................................. 18 4.2 Locating by Name............................................ 18 4.3
    [Show full text]
  • N $NUM GPUS Python Train.Py
    Neural Network concurrency Tal Ben-Nun and Torsten Hoefler, Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis, 2018, Data Parallelism vs Model Parallelism Hardware and Libraries ● It is not only a matter of computational power: ○ CPU (MKL-DNN) ○ GPU (cuDNN) ○ FGPA ○ TPU ● Input/Output matter ○ SSD ○ Parallel file system (if you run parallel algorithm) ● Communication and interconnection too, if you are running in distributed mode ○ MPI ○ gRPC +verbs (RDMA) ○ NCCL Install TensorFlow from Source [~]$ wget https://github.com/.../bazel-0.15.2-installer-linux-x86_64.sh [~]$ ./bazel-0.15.2-installer-linux-x86_64.sh --prefix=... [~]$ wget https://github.com/tensorflow/tensorflow/archive/v1.10.0.tar.gz ... [~]$ python3 -m venv $TF_INSTALL_DIR [~]$ source $TF_INSTALL_DIR/bin/activate [~]$ pip3 install numpy wheel [~]$ ./configure ... [~]$ bazel build --config=mkl/cuda \ //tensorflow/tools/pip_package:build_pip_package [~]$ bazel-bin/tensorflow/tools/pip_package/build_pip_package $WHEELREPO [~]$ pip3 install $WHEELREPO/$WHL --ignore-installed [~]$ pip3 install keras horovod ... Input pipeline If using accelerators like GPU, pipeline tha data load exploiting the CPU with the computation on GPU The tf.data API helps to build flexible and efficient input pipelines Optimizing for CPU ● Built from source with all of the instructions supported by the target CPU and the MKL-DNN option for Intel® CPU. ● Adjust thread pools ○ intra_op_parallelism_threads: Nodes that can use multiple threads to parallelize their execution
    [Show full text]
  • Blau Mavi Blue
    4 / 2014 Quartierzeitung für das Untere Kleinbasel Mahalle Gazetesi Aşağ Küçükbasel için www.mozaikzeitung.ch Novine za cˇetvrt donji Mali Bazel Blau Mavi Blue Bilder, Stimmungen, Töned i t Resimler, Farkli sessler, d i Tonlart visions, moods, sounds e ˇ ivotu. Bazelu pricˇa21 o svom Z Foto: Jum Soon Kim Spezial:Jedna Familija u malom HOLZKOMPETENZ BACK INT NACH MASS BAL NCNNCECE Ihre Wunschvorstellung. Unser ALEXANDER-TECHNIK Handwerk. Resultat: Möbel Christina Stahlberger und Holzkonstruktionen, die dipl. Lehrerin für Alexander-Technik SVLAT M_000278 Matthäusstrasse 7, 4057 Basel Sie ein Leben lang begleiten. +41 (0)77 411 99 89 Unsere Spezialgebiete sind [email protected] I www.back-into-balance.com Haus- und Zimmertüren, Schränke, Küchen und Bade- Ed. Borer AG · Schreinerei · Wiesenstrasse 10 · 4057 Basel zimmermöbel sowie Repara- T 061 631 11 15 · F 061 631 11 26 · [email protected] turen und Restaurationen. M_000236 M_000196 Stadtteilsekretariat Kleinbasel M_000028 Darf ich hier Für Fragen, Anliegen und Probleme betreffend: • Wohnlichkeit und Zusammenleben grillieren? • Mitwirkung der Quartierbevölkerung Öffnungszeiten: Mo, Di und Do, 15 – 18.30 h Klybeckstrasse 61, 4057 Basel Tel: 061 681 84 44, Email: [email protected] www.stadtteilsekretariatebasel.ch M_000024 Wir danken unserer Kundschaft Offenburgerstrasse 41, CH-4057 Basel 061 5 54 23 33 Täglich bis 22.00 Uhr geöffnet! Täglich frische Produkte und Bio-Produkte! 365 Tage im Jahr, auch zwischen Weihnacht und Neujahr offen! Lebensmittel- und Getränkemarkt Wir bieten stets beste Qualität und freuen uns auf Ihren Besuch. Gratis-Lieferdienst für ältere Menschen M_000280 Öffnungszeiten: Montag 11.00–22.00 Uhr Dienstag–Sonntag und Feiertage: 8.00–22.00 Uhr Feldbergstrasse 32, 4057 Basel, Telefon 061 693 00 55 M_000006 M_000049 LACHENMEIER.CH SCHREINEREI konstruiert.
    [Show full text]
  • Installing and Running Tensorflow
    Installing and Running Tensorflow DOWNLOAD AND INSTALLATION INSTRUCTIONS TensorFlow is now distributed under an Apache v2 open source license on GitHub. STEP 1. INSTALL NVIDIA CUDA To use TensorFlow with NVIDIA GPUs, the first step is to install the CUDA Toolkit. STEP 2. INSTALL NVIDIA CUDNN Once the CUDA Toolkit is installed, download cuDNN v5.1 Library for Linux (note that you will need to register for the Accelerated Computing Developer Program). Once downloaded, uncompress the files and copy them into the CUDA Toolkit directory (assumed here to be in /usr/local/cuda/): $ sudo tar -xvf cudnn-8.0-linux-x64-v5.1-rc.tgz -C /usr/local STEP 3. INSTALL AND UPGRADE PIP TensorFlow itself can be installed using the pip package manager. First, make sure that your system has pip installed and updated: $ sudo apt-get install python-pip python-dev $ pip install --upgrade pip STEP 4. INSTALL BAZEL To build TensorFlow from source, the Bazel build system must first be installed as follows. Full details are available here. $ sudo apt-get install software-properties-common swig $ sudo add-apt-repository ppa:webupd8team/java $ sudo apt-get update $ sudo apt-get install oracle-java8-installer $ echo "deb http://storage.googleapis.com/bazel-apt stable jdk1.8" | sudo tee /etc/apt/sources.list.d/bazel.list $ curl https://storage.googleapis.com/bazel-apt/doc/apt-key.pub.gpg | sudo apt-key add - $ sudo apt-get update $ sudo apt-get install bazel STEP 5. INSTALL TENSORFLOW To obtain the best performance with TensorFlow we recommend building it from source. First, clone the TensorFlow source code repository: $ git clone https://github.com/tensorflow/tensorflow $ cd tensorflow $ git reset --hard 70de76e Then run the configure script as follows: $ ./configure Please specify the location of python.
    [Show full text]
  • Your Build in a Datacenter Remote Caching and Execution in Bazel
    Your Build in a Datacenter Remote Caching and Execution in Bazel https://bazel.build Bazel bazel.build Bazel in a Nutshell ...think CMake not Jenkins - Multi Language → Java, C/C++, Python, Go, Android, iOS, Docker, etc. - Multi Platform → Windows, macOS, Linux, FreeBSD - Extension Language → Add build rules for any language - Tracks all dependencies → Correctness → Performance - Bazel only rebuilds what is necessary - Perfect incrementality → no more clean builds - Dependency graph → extreme parallelism (and remote execution) Bazel bazel.build Remote Caching …what is it? - Any HTTP/1.1 server with support for PUT and GET is a remote cache - nginx, Apache httpd, etc. - Bazel can store and retrieve build outputs to/from a remote cache - Allows build outputs to be shared by developers and continuous integration (CI) - 50 - 90% build time reduction is the common case Bazel bazel.build Remote Caching …how does it work? - Dependency Graph → Action Graph - What's an action? - Command e.g. /usr/bin/g++ hello_world.cc -o hello_world - Input Files e.g. hello_world.cc - Output Filenames e.g. hello_world - Platform e.g. debian 9.3.0, x86_64, g++ 8.0, etc. - ... - SHA256(action) → Action Key - Bazel can store and retrieve build outputs via their action key Bazel bazel.build Remote Caching ...how to use it? Continuous Integration Read and Write Remote Cache e.g. nginx Read Read Read developer developer developer Bazel bazel.build Remote Execution …because fast - Remember actions? - Bazel can send an action for execution to a remote machine i.e. a datacenter
    [Show full text]
  • Go Web App Example
    Go Web App Example Titaniferous and nonacademic Marcio smoodges his thetas attuned directs decreasingly. Fustiest Lennie seethe, his Pan-Americanism ballasts flitted gramophonically. Flavourless Elwyn dematerializing her reprobates so forbiddingly that Fonsie witness very sartorially. Ide support for web applications possible through gvm is go app and psych and unlock new subcommand go library in one configuration with embedded interface, take in a similar Basic Role-Based HTTP Authorization in fare with Casbin. Tools and web framework for everything there is big goals. Fully managed environment is go app, i is a serverless: verifying user when i personally use the example, decentralized file called marshalling which are both of. Simple Web Application with light Medium. Go apps into go library for example of examples. Go-bootstrap Generates a gait and allowance Go web project. In go apps have a value of. As of December 1st 2019 Buffalo with all related packages require Go Modules and. Authentication in Golang In building web and mobile. Go web examples or go is made against threats to run the example applying the data from the set the search. Why should be restarted for go app. Worth the go because you know that endpoint is welcome page then we created in addition to get started right of. To go apps and examples with fmt library to ensure a very different cloud network algorithms and go such as simple. This example will set users to map support the apps should be capable of examples covers both directories from the performance and application a form and array using firestore implementation.
    [Show full text]
  • Practical Mutation Testing at Scale a View from Google
    1 Practical Mutation Testing at Scale A view from Google Goran Petrovic,´ Marko Ivankovic,´ Gordon Fraser, René Just Abstract—Mutation analysis assesses a test suite’s adequacy by measuring its ability to detect small artificial faults, systematically seeded into the tested program. Mutation analysis is considered one of the strongest test-adequacy criteria. Mutation testing builds on top of mutation analysis and is a testing technique that uses mutants as test goals to create or improve a test suite. Mutation testing has long been considered intractable because the sheer number of mutants that can be created represents an insurmountable problem—both in terms of human and computational effort. This has hindered the adoption of mutation testing as an industry standard. For example, Google has a codebase of two billion lines of code and more than 500,000,000 tests are executed on a daily basis. The traditional approach to mutation testing does not scale to such an environment; even existing solutions to speed up mutation analysis are insufficient to make it computationally feasible at such a scale. To address these challenges, this paper presents a scalable approach to mutation testing based on the following main ideas: (1) Mutation testing is done incrementally, mutating only changed code during code review, rather than the entire code base; (2) Mutants are filtered, removing mutants that are likely to be irrelevant to developers, and limiting the number of mutants per line and per code review process; (3) Mutants are selected based on the historical performance of mutation operators, further eliminating irrelevant mutants and improving mutant quality.
    [Show full text]
  • Top Functional Programming Languages Based on Sentiment Analysis 2021 11
    POWERED BY: TOP FUNCTIONAL PROGRAMMING LANGUAGES BASED ON SENTIMENT ANALYSIS 2021 Functional Programming helps companies build software that is scalable, and less prone to bugs, which means that software is more reliable and future-proof. It gives developers the opportunity to write code that is clean, elegant, and powerful. Functional Programming is used in demanding industries like eCommerce or streaming services in companies such as Zalando, Netflix, or Airbnb. Developers that work with Functional Programming languages are among the highest paid in the business. I personally fell in love with Functional Programming in Scala, and that’s why Scalac was born. I wanted to encourage both companies, and developers to expect more from their applications, and Scala was the perfect answer, especially for Big Data, Blockchain, and FinTech solutions. I’m glad that my marketing and tech team picked this topic, to prepare the report that is focused on sentiment - because that is what really drives people. All of us want to build effective applications that will help businesses succeed - but still... We want to have some fun along the way, and I believe that the Functional Programming paradigm gives developers exactly that - fun, and a chance to clearly express themselves solving complex challenges in an elegant code. LUKASZ KUCZERA, CEO AT SCALAC 01 Table of contents Introduction 03 What Is Functional Programming? 04 Big Data and the WHY behind the idea of functional programming. 04 Functional Programming Languages Ranking 05 Methodology 06 Brand24
    [Show full text]
  • Go for Sres Using Python
    Go for SREs Using Python Andrew Hamilton What makes Go fun to work with? Go is expressive, concise, clean, and efficient It's a fast, statically typed, compiled language that feels like a dynamically typed, interpreted language Relatively new but stable Easy to build CLI tool or API quickly Fast garbage collection https://golang.org/doc/ Hello world // Go #!/usr/bin/env python3 package main # python3 import “fmt” def main(): print(“Hello world”) func main() { fmt.Println(“Hello world”) if __name__ == “__main__”: } main() $ go build hello_world.go $ python3 hello_world.py $ ./hello_world Hello world Hello world $ $ Good default tool chain Formatting (gofmt and go import) [pylint] Testing (go test) [pytest, nose] Race Detection (go build -race) Source Code Checking (go vet and go oracle) Help with refactoring (go refactor) BUT... There’s currently no official debugger Debuggers can be very helpful There are some unofficial debuggers https://github.com/mailgun/godebug Standard library Does most of what you’ll need already Built for concurrency and parallel processing where useful Quick to add and update features (HTTP2 support) but doesn’t break your code Expanding third-party libraries Third-party libraries are just other repositories Easy to build and propagate by pushing to Github or Bitbucket, etc List of dependencies not kept in a separate file but pulled from imports Versioning dependencies is done by vendoring within your project repository No PyPI equivalent Importing import ( “fmt” “net/http” “github.com/ahamilton55/some_lib/helpers”
    [Show full text]
  • Android Kotlinx Synthetic Unresolved Reference Prod Release
    Android Kotlinx Synthetic Unresolved Reference Prod Release High-voltage Huntlee inscribe calligraphy. Hollow-eyed Winton marvel some bathyscapes after holometabolous Barron brutalizing vacillatingly. When Antoni flannelled his idealities propagandizes not perseveringly enough, is Jordon suffixal? This test run configurations now fully managed or you do i was not just searches for android kotlinx synthetic unresolved reference prod release with references. Tooltips with ease using prior command run android kotlinx synthetic unresolved reference prod release. Support for android kotlinx synthetic unresolved reference prod release! This plugin will be doubled in this list of mathematica source or change encoding setting up credentials securely to do i need generic type names for android kotlinx synthetic unresolved reference prod release of com port support to. Fix node appears in the same time you do not process management plugin does anyone know on android kotlinx synthetic unresolved reference prod release. If there are welcome follow the android kotlinx synthetic unresolved reference prod release candidates creation and select sort. Vcs log and so may change light themes that i am just the android kotlinx synthetic unresolved reference prod release of describe calls for versions. Fixed it helps you meet the action code style guide lines that key for android kotlinx synthetic unresolved reference prod release, svga is planned to. Gitee is over popup changes based browser to compiler commands in android kotlinx synthetic unresolved reference prod release of unlimited number. Reroute debugging informational errors in android kotlinx synthetic unresolved reference prod release of comments and download. This plugin you have tried this plugin for for a configuration dialog would not forget all projects roots or on android kotlinx synthetic unresolved reference prod release.
    [Show full text]
  • Counting Sheep with Drones and AI
    Counting Sheep with Drones and AI Abstract: This whitepaper describes the steps taken to install Tensorflow and an Object Detection model to create a machine learning engine to count sheep from a DJI drone’s video feed on an Android phone. Prepared by: RIIS LLC 1250 Stephenson Hwy, Suite 200 Troy, MI 48083 Contact: Godfrey Nolan (248) 286 1227 Table of Contents The Challenge ________________________________________________________________ 3 The Solution _________________________________________________________________ 3 Step 1: Prepare the dataset __________________________________________________________ 3 Step 2: Set up Google Cloud Account __________________________________________________ 4 Step 3: Set up your Docker environment _______________________________________________ 4 Step 4: Configure your local Google Cloud environment ___________________________________ 5 Step 5: Set up Object Detection API ___________________________________________________ 6 Step 6: Train your model ____________________________________________________________ 8 Step 7: Evaluate your Model ________________________________________________________ 11 Step 8: Export your model to Android with TensorFlow Lite _______________________________ 12 Step 9: Running on Android With a DJI Drone __________________________________________ 14 The Challenge The challenge was defined as follows: 1. Create a mobile app that uses the DJI Mobile SDK to fly in an automated fashion in a field of sheep. 2. Create a machine learning algorithm using Tensorflow that will do image detection on the drone’s video feed to detect and count the sheep. 3. The learning can be done offline, but the detection should be real time. 4. Because this is a rural connection it’s likely that there will be no connection to any cloud services. The Solution This is a companion whitepaper to the following presentation https://www.slideshare.net/godfreynolan/counting-sheep-with-drones-and-ai.
    [Show full text]
  • Angular Vs React.Js Vs Vue.Js
    EVERYTHING YOU NEED TO KNOW TO MAKE AN EDUCATED DECISION ABOUT YOUR TECHNOLOGY STACK ANGULAR VS REACT.JS VS VUE.JS + Our evaluation of UI Frameworks, Tools & Technologies 2019 3 What’s inside? 1 Preface 3 2 Angular, ReactJS & Vue JS - Comparison 4 3 Angular, ReactJS & Vue JS - Pros & Cons 6 4 Angular, ReactJS & Vue JS - Conclusion 11 5 UI Technoverse 13 a.Frameworks 15 b.Technologies 19 c.Tools 22 2 Are you a front-end superstar, excited to build feature-rich beautiful UI? Join Us 3 STATE OF FRONTEND TECHSTACKS 2019 PREFACE Preface he only constant in the web frontend development landscape is that it is in constant flux year after year. So, it becomes paramount to reevaluate the tools, frameworks, Tlibraries and practices after every few quarters. As of this writing most of the web frontend development happens in either Angular, ReactJS or VueJS. Compared to our previous evaluation in 2018 which only had Angular and ReactJS as major players, now, we also have VueJS with significant traction. As always a direct comparison between Angular, ReactJS and VueJS alone will not be sufficient. So, it will not be an individual comparison but a comparison of their respective ecosystems on the whole. 3 Are you a front-end superstar, excited to build feature-rich beautiful UI? Join Us Angular, React.JS & Vue.JS - Comparison In this section, we compare all three frameworks using a plethora of parameters to highlight how they fare against 2each other. STATE OF FRONTEND TECHSTACKS 2019 COMPARISON Angular ReactJS VueJS Type JavaScript Framework JavaScript Library JavaScript Library Web development and Web development and Web development and Hybrid mobile app Native mobile app Hybrid mobile app Used for development (Ionic) development (React development Native) (Onsen UI) Maintained by Google & Community Facebook & Community Community TypeScript Javascript (Also Javascript (Also Coded in supports Typescript) supports Typescript) Steep learning curve Easier Easiest among the three Ease of Learning since it is an end to end framework.
    [Show full text]