BeData data in 10 professions

1 INTRODUCTION

There are more and more data professions. A video and a glossary complement this comic. These medium make it possible Given that the amount of data collected by companies has increased consi- to materialise the data value chain: like an industrial chain, it is evident that at derably over the past 20 years, data professions are more numerous, varied, each step of data processing a different profession intervenes. It is important to but also promising. To understand this constantly changing world, let’s explore specify that depending on the size of the company, its geographical footprint 10 key professions in a «DATA CENTRIC» organisation. This comic was ( / International), its sector / industry, its organisation (rather centralised / designed as part of the classification, and clarification work of data professions decentralised), some employees may “combine” roles or, on the contrary, many by the Data commission of Syntec Conseil. Its aim is to serve as benchmarks. other more specific roles exist (for example in banking, Data Marketing or even within ISDs). Enjoy your reading!

2 3

Our student apprentice is about to arrive. I’ll greet her. Let’s go ! I’m going to impress her! She will be speechless after my presentation!

Come on, I’m going to be clear and precise while remaining cool!

Good presentation Who’s this? is part of the This is our young and basics. 4 brilliant student apprentice ! 5 Welcome !

You wanted to discover data professions? Good, good!

Pay close attention ! Open your eyes and listen carefully!

And be careful to where you set foot. Well, here is our first My dear, Of all of us, Like Not bad... Do data hero… let me introduce you he is probably the one C++, SQL? you know of any to our… who speaks the most different languages! others?

… Without R, PYTHON… … MATLAB… forgetting Julia!

6 7

He develops Hmm! Okay, He processes raw data He is above all a the infrastructure defined I think we collected both internally and computer specialist! with the data architect. got it! She knows externally. a thing or two...

Let’s proceed!

He ensures And makes it evolve Then he has to speak the maintenance… according to the latest a lot of languages… technologies and new The Data Engineer develops the infrastructure defined by / with the Data Architect. He builds robust technical solutions (via robustness tests) and reliable. He ensures their maintenance and upgrades in accordance with safety constraints. solutions. Indeed! He integrates data of various kinds that come from these multiple sources, supervises them and checks the quality of the The data data. In production, he ensures the follow-up and monitoring of data flows / interfaces. He also ensures that his work is sufficiently documented (programs, interfaces, inputs / outputs, formats, etc.). mechanic Levels & Types of Training Training New The Data Engineer is above all a computer specialist. • Pro Expert Masters in Computer Science, technologies He must master a certain number of languages, technologies Business Intelligence and Big Data option (Lyon) and and methods: Python, SQL, ETL and its “modern” versions • School 42, Jedha Bootcamp… NoSQL (Hive, Impala, Spark SQL) and Hadoop for the Big • Specialised Master in Big Data (Télécom ParisTech, solutions Data part, the Cloud, DevOps and CRISP methods. EM Grenoble, Essec, etc.) • A level +5 years of study compulsory • Master in Computer Science, specialising in Computer • Specialised engineering school Science and Business Intelligence Exploration • Google Cloud Certified Certification - Professional Data Engineer A computer specialist I am the data controller, I present to you She is the quality manager…… But I’m the... more than that!

Yes, that’s right, the data steward is extremely And she knowledgeable about data professions and all has some the data that relates to them. authority!

8 9

I would even OK, OK, thank you! See you It is often a say that I play a Come on, let’s soon! senior profile Hmm. I mean Otherwise key role… move on! this position requires a we’ll never be able lot of experience to stop her... and many years of practice...

… In the data governance project!!!

She not only ensures that the data is ... But also that present and compliant... it is relevant and understood by all. The data quality and business The role of the Data Steward is to ensure that the data is relevant, present, compliant, consistent, understood. controller It translates the business rules relating to the quality of data into requests (allowing their regular verification), he defines the quality indicators and the corresponding tolerance thresholds. The Data Steward is the referent in a data governance project. He plays a key role in its realisation, in particular because he acquires the knowledge of the business of data and Specialist their metadata. The Data Steward is a senior person with some authority in the organisation who has agreed to be the person responsible for the quality of a defined dataset, for example, the CFO for financial data.

Quality Manager … And also cyber safety And your Data Architect, Come in here can he also carry out an Look, he’s our... constraints. now... extension?

… That’s right. Hmm... If the system He takes charge of that. needs a change... (And he’s a champion for extinguishing the fires on the schedule!)

10 11

Once the foundations have been He optimises collection and He offers models that respond But is he alone to do No, he works with other data Let’s continue our visit, laid, he organises the recovery storage infrastructure. to business challenges. all this? experts! you haven’t seen anything of raw data and their yet... management.

He also takes into account He can work on the data regulatory changes... dictionary... … That Le Data Architect intervenes upstream of the data processing to organise the recovery and management of raw data, Hold this, my more or less structured, in more or less large quantity and coming from various sources (internal, external). After the data is here! The data inventory, he defines and optimises the collection, storage, handling infrastructure and the associated flows. He proposes friend! systems modelling changes to meet the challenges of the business lines and facilitate the cross-referencing of downstream data. He may have to work on the data dictionary, the design of the CDM (Conceptual Data Model) or the inventory of the frame architect of reference in place. The role is more and more fundamental in a context of cloud architecture, open, real time and cyber Responds safety and regulatory constraints. to business challenges Levels & types of training • A Level + 5 years of study in IT, Management, Statistics • Big Data training • Specialised engineering school • Mastery of Business Intelligence and necessary experience in this field (a double course in business strategy is a plus) OpenClassRomms offers a work-study Data Architect training programme, and has partnered with Centrale Supélec to create an online Data Architect training course. The architect If you don’t mind Here is the... yes !!! She is versatile coming in, here is our next acquaintance.

In short, she extracts And agile! the substantive marrow With whom? with brio!

12 13

As you can see, she combines I have the unpleasant mathematical, statistical and computer And she is able to feeling that they are laughing But let her tools. code them… Yes, yes! at me... work...

Impressive!

The Data Scientist processes, analyses and enhances the data of a company in order to define the best development strategy: She even Mathematics and marketing and sales strategy, improvement of performance and profitability, forecasting... With knowledge of mathematical / statistical builds intelligent and IT tools, he is able to code them (R, Python), to produce methods (automated, as much as possible) for sorting and analysing NO ??!! computer science hold mass data and more or less complex or disjointed sources, and to build «intelligent» algorithms, in order to extract useful information. algorithms!!! no secrets for him in The missions of the Data Scientist are different order to extract the (very often in connection with the Data Engineer and the Data Architect): substantive marrow Data Science training • Explore new sources of data to broaden the ability to identify Engineering degree with Big Data / Data Science specialisation: from the data business and operational efficiency issues more precisely and • IAMD - Engineer and Applications of Big Data (Télécom Nancy) quickly • Master 2 Mathematics and Applications: Data Science course Advanced • Take advantage of cutting-edge technologies to obtain better (École Polytechnique) technologies data analysis and design (predictive) models • Big Data & Data Science (Mine Nancy) • Data Science (Ensae ParisTech) • Participate in the industrialisation of these models • Bachelor in Data Science (FHNW, University of Applied Sciences • Combine structured and unstructured data analysis methods and Northwestern Switzerland) knowledge of the field of study to provide business with decision • Master in Data Science (EPFL, École Polytechnique Fédérale de support models: Lausanne) - Transform business issues into mathematical problems Versatile - Apply statistical models to explain a given problem Levels & Types of training and agile - Provide businesses with clear presentations of the analysis • A Level +5 years of study in Computer Science and Advanced carried out on their issues, highlighting possible leads Mathematics, Econometrics for development. • Specialised engineering school Come over here, Oh, I know who she is! She is very She loves I’ll introduce you to “Brio”. organised. methodologies and statistics ???

It’s Justine, she is my Hmm… And do you have She is like a demanding … Who makes sense of the data! Oh, really? neighbour and lives across any idea about what’s interpreter... the street! her job? So who is she?

0111000110 01100111001010 10011001

14 15

okayyy… Let’s move Of course! • on... She is:

... And assesses their She reliability to optimise business locates relevant decision-making. information... The interpreter The Chief Analytics Officer exploit IT and technical tools and uses statistical methods (including Data Science) who makes sense to help organise, synthesise and translate data efficiently. He identifies, among all the information available to the company, which is the most important / relevant to extract for optimal decision-making, based on an objective of the data methodology based on statistics. Where appropriate, he ensures that the information collected internally or organisation externally is reliable, consistent and ready for analysis. He can also pilot the industrialisation of the process for the most interesting data. He organises, synthesises and translates information to facilitate decision-making. and method Training levels • A Level +5 years of study in Computer Science, Statistics • Studies in Data Science or Econometrics • 10 years of experience at least

Very organised He popularises To make it Your cousin the results So here I imagine... understandable and usable is... by all.

No, he is the translator … and data on Ah, you already • Not really, between business issues, on the the other. OK, so here His thing is graphs, know?! no… one hand... is our... right? I tried a note of humour ...

16 17

… We call it Graphics in But he is also called “Data Visualisation”! Data... the data analyst.

He’s the man for the job to guide strategic He defines decision-making. the key performance The translator indicators. The Data Consultant / Data Analyst generally works on a specific type of data from a single and known source, between business and which he analyses with a «business» perspective in order to guide strategic decision-making. Working with Data Scientists data issues and business experts, he defines key performance indicators in particular (KPIs) to popularise and give his results back to decision makers through an exploitable format. He uses the various data tools at his disposal in order to explore, organise, synthesise and translate raw data such as, for example, consumption trends or a significant change in buyer profiles. He can be more broadly responsible for telling an organisation what it can expect from its data (including outside its most Key common areas) and providing an operational response. performance indicators Levels & types of training • A Level +5 years of study, Big Data and / or Mathematics and / or Statistics and / or Business Intelligence • Specialised engineering school

data analyst … I present Speaking of Data Visualisation, He is more actually… to you the… actually no ! in the form than in not exactly… programming ? yes. That’s it.

He is also a He creates web or mobile developer! Classy! applications

18 19

I feel you like this one… You He is an artist, the painter of So that we can always Yes! I am! would have liked to have done the canvas, the storyteller! see more clearly, Shh. this job, right?! ...

To open up new avenues of analysis!

He contextualises company data as simply as As his name suggests, possible. He gives them meaning and illuminates The artist of he is an expert in «Dataviz» the impact of their use. data, the painter The Data Visualisation Consultant is the company’s storyteller. He is able to exploit business data, contextualise it and tools! of the canvas who provide simple visualisations to explore its meaning and impacts. Thanks to his judicious choice of spatial organisation, links between data, colours, shapes, the Data Visualisation expert stages complex data, makes them intelligible and accessible in As they say! excels in the order to present them to actors without technical expertise. This profile has two facets: he is an export of a data visualisation tools doing reporting and storytelling on data, or he is a developer who creates data visualisation applications, whether in the In a nutshell: description of his work... intranet, on the web, on mobile applications or on paper. I glorify! Through his work on interfaces, this expert also enables operational teams to see more clearly in reliable data by asking the right questions, and to identify new avenues of analysis by exploring data in a new light. He must be able to choose the most relevant visualisations and likely to bring the least bias. Gives meaning Levels & Types of training • A Level +5 years of study in Mathematics and / or Statistics + Business Intelligence (data analysis and Data visualisation) • Specialised engineering school

An artist Actually, he makes sure that the He suggests This other data professional works … The data engineer we met systems already available are following the best regularly with... at the start… performing well! practices...

That way, So, he is … And using mistakes are the the best avoided! tools

20 21

And the productivity of the entire chain is He is the team’s We can see Damn, it’s improved! production manager. him as… Exactly! stuck...

As… Well, he automates As… quality The team’s production The DataOps Engineer orchestrates the production data analysis pipeline, promotes production functionality manager and automates quality, always in conjunction with the Data Engineer. He also ensures that systems already in production are available and performing. The DataOps Engineer also suggests best practices and best tools among data science teams to improve productivity and avoid common FYI, mistakes. this jacket was brand Performance Levels & types of training new!!! • A Level +5 years of study compulsory in IT / Big Data • Specialised engineering school

An evangelist Here I am again! He is up to date with Fortunately, I have a And this one, No… the latest regulatory spare jacket! do you know Hmm… developments... his job?

... To ensure that we comply And he follows IT security with the legislation on news... personal data.

Well, this is our :

22 23

You can think of it a bit like the data Lend me Commissioner of the proper application protector. your sword, I He measures of the rules, he also ensures that, with us, really like it too the risks! the data is well guarded and secure. much!

Him! I see!

He defines the roles and He ensures responsibilities of each. Since May 2018 and the application of the General Data Protection Regulation (GDPR), this position is man- compliance with The commissioner datory in Europe in companies and administrations that process sensitive or large-scale data. for the proper The mission of The Data Protection Officer (DPO) is to inform about, advise about and control of data the regulations. governance (particularly personal data). Its challenge is to keep abreast of all the company’s data-related application of projects, in order to be able to make recommendations sufficiently upstream in privacy by design initiatives. regulations around It is a profession at the crossroads of law, IT security, compliance and ethics. personal data The DPO is responsible for ensuring compliance with the regulations, defining the roles and responsibi- lities of each, establishing a mapping of processing and data flows, keeping the processing register and (in particular GDPR overseeing the management of security incidents (including those with subcontractors). in Europe) Training levels DPOs are often hybrid profiles, who can measure risks, Three types of employees are potentially concerned: Regulatory manage IT projects and integrate the notion of “Privacy developments by design”. • Company lawyers and more generally all functions linked to the General Secretariat • Data project managers Control • Internal auditors mission And this is the last Disseminated for today! There is a lot It is varied, Here is a new professional of data, throughout the to discover. Last but company. not least...

And guarantee easy Her mission is to steer the choice and secure access to of platforms and ecosystems information! She is the: to access data...

24 25

Where is she She’s the data A real bridge between from? manager!!! the various departments In few words: What is she of the company and “make ultra-complicated doing? the IT department, things simple”!

Her mission She is so annoying. She finishes Make The Chief Data Officer, or CDO, creates an environment that allows different managers in the business to easily - and securely - access is to my sentences thinking she knows the information they need for optimal strategic decision-making. He must find the most appropriate platforms, Data & Business Intelligence ultra-complicated what I’m going to say... The conductor software systems, and ecosystems (dataset, etc.) so that everyone can perform analyses independently. The CDO is therefore at the heart of things simple! its organisation. The CDO is also responsible for the quality and consistency of the data. His function therefore intersects with those of other of a data-centric professions such as the management controller, the IT director (CIO) or the head of operational activities. He works in close collaboration transformation with all the data specialists within his company. Training levels (Big) Data training • A Level +5 years of study in IT, Management, Statistics Specific training now exists in (Big) Data. They are still few in number. Platforms and / or Marketing Here are some examples of existing training: and • Big Data training • MSc (master of sciences): Statistics for Smart Data (Ensai) ecosystems • 10 years of experience at least • Big Data for Business (École polytechnique - HEC) • Data Sciences & Business Analytics (Centrale Spelec - Essec Business School) • Applied Data Science & Big Data (Data Science Institute) • Data Science (Ensae ParisTech) Data • Data management ( School of Business) Director Oh dear… Yes and no, So there you have DATA BRUTES not really! COLLECTEES it. Did you like it? EN INTERNE DATA BRUTES COLLECTEES Yes! EN EXTERNE IA / MACHINE So, did we just Really?! LEARNING discovered all data CHIEF ANALYTICS professions? OFFICER DATA SCIENTIST DATA DATA ENGINEER ARCHITECT

CHIEF DATA DATA DATA OFFICER PROTECTION CONSULTANT OFFICER DATA DATA VISUALISATION STEWARD CONSULTANT

These professions are developing Let me tell you a secret! at high speed! For example, we did New professions will appear not mention the Data Owner or CLIENT in the years to come! the Data Manager. DATAOPS ENGINEER DATA LAKE / BIG DATA

Awesome!

26 YEAR 2020

Discovering the consulting professions : www.concepteursdavenirs.fr To find out more about the main data professions : www.syntec-conseil.fr

Character creation: Adrien Liard Design and production: Six Content production and writing: Members of the Syntec Conseil Data commission Dialogue writing Agence Elo A Comic ordered and financed by Atlas, OPCO of financial services and consulting, following defined cooperation lines in the agreement signed with the Ministry of

Education and Youth, the Ministry of Higher Education, Research and Innovation, - 04 2021 www.six.fr with the help of funds collected under the apprenticeship tax.