<<

A Exploration of the X-ray Transient Sky using the EXTraS Science Gateway

Daniele D’Agostinoa,∗, Duncan Law-Greenb, Mike Watsonb, Giovanni Novarac,d, Andrea Tiengoc,d,e, Stefano Sandrellid, Andrea Belfiored, Ruben Salvaterrad, Andrea De Lucad,e

aNational Research Council of Italy - CNR-IMATI, Genoa, Italy bDept.of Physics & Astronomy University of Leicester, U.K. cScuola Universitaria Superiore IUSS Pavia, Italy dNational Institute for Astrophysics INAF, Milan, Italy eIstituto Nazionale di Fisica Nucleare - INFN, Italy

Abstract Modern soft X-ray observatories can yield unique insights into time domain astrophysics, and a huge amount of information is stored - and largely un- exploited - in data archives. Like a treasure-hunt, the EXTraS project har- vested the hitherto unexplored temporal domain information buried in the serendipitous data collected by the European Photon Imaging Camera in- strument onboard the XMM-Newton satellite in 20 years of observations. The result is a vast catalog, describing the temporal behaviour of hundreds of thousands of X-ray sources. But the catalogue is just a starting point because it has to be, in its turn, further analysed. During the project an education activity has been defined and run in several workshops for high school students in Italy, Germany and UK. The final goal is to engage the students, and in perspective citizen scientists, to go through the whole vali- dation process: they look into the data and try to discover new sources, or to characterize already known sources. This paper describes how the EXTraS science gateway is used to accomplish these tasks and highlights the first discovery, a flaring X-ray source in the globular cluster NGC 6540. arXiv:1911.06559v1 [astro-ph.IM] 15 Nov 2019 Keywords: Science Gateways; Astrophysics; Citizen science; X-ray astronomy; Virtual Observatory

∗Corresponding author Email address: [email protected] (Daniele D’Agostino)

Preprint submitted to Future Generation Computer Systems November 18, 2019 1. Introduction X-ray astronomy probes a wide diversity of phenomena, related to the most extreme physical conditions that can be observed in the Universe: very strong gravitational and/or electromagnetic fields, very high temperatures, populations of particles moving close to the speed of light [1]. Almost all cosmic X-ray sources, from flaring stars in the solar neighbourhood, to ac- creting supermassive black holes in galactic nuclei at cosmological distances, are variable as a function of time and characterized by flux and spectral changes on distinct time scales, ranging from a fraction of a second to sev- eral years. Indeed, variability studies are crucial to understand the nature and physics of the sources. Every day observing facilities with time-resolved imaging capabilities [2] collect huge amounts of potentially interesting information about serendip- itous1 X-ray sources and their temporal variability, which remains mostly unused, stored in data archives. In the soft X-ray range (∼ 0.1 − 12 keV), the European Photon Imaging Camera (EPIC) instrument onboard the Euro- pean Space Agency mission XMM-Newton is the most powerful tool to study the variability of faint X-ray sources, thanks to its unprecedented combina- tion of large effective area, good angular, spectral and temporal resolution, and large field of view [3, 4]. Twenty years after its launch, EPIC is still fully operational and its immensely rich archive of data, the XMM-Newton Science Archive2 (XSA), keeps growing. Major efforts are ongoing to explore the serendipitous content in XMM data. For instance, a catalogue including all sources detected in EPIC obser- vations is being regularly updated, as more data become available, and its most recent release3 is the richest catalogue of X-ray sources ever produced. However, temporal domain information is not systematically extracted. The first, complete study of time variability in EPIC sources was carried out by the European-funded “Exploring the X-ray Transient and variable Sky” project (EXTraS, http://www.extras-fp7.eu) [5], which was run from 2014

1lying by chance in the instrument field of view, in the portion of the sky surrounding the target of the observation 2http://nxsa.esac.esa.int/nxsa-web 3https://heasarc.gsfc.nasa.gov/W3Browse/xmm-newton/xmmssc.html

2 to 2016 by several scientific institutions led by INAF, the Italian National Institute for Astrophysics: CNR-IMATI and IUSS Pavia (Italy) MPE and Friedrich-Alexander-Universit¨atErlangen-N¨urnberg (Germany) and Univer- sity of Leicester (UK). Its main goal was the realisation of a vast catalogue to be released to the astronomical community, describing and quantifying in a set of synthetic parameters the temporal behaviour of all EPIC sources. To accomplish this goal, new data analysis algorithms were designed and im- plemented. The results and software are available through a science gateway (http://portal.extras-fp7.eu) [6] made up of an open access database, named the EXTraS Archive, and a portal for running the software on new data or to re-analyse an observation with different parameters. The software is of particular importance for enhancing the discovery potential of the XMM- Newton mission [7]. This is especially true as we look to the future: the EPIC instrument is still fully operational and collects new data daily, and its operations may be extended for more than one decade. As the most sensitive search for variability ever performed, EXTraS is raising new questions in high-energy astrophysics [8] and may serve as a pathfinder for future missions. The output catalogue, together with our analysis tools, discloses a huge amount of information, relevant for studies of almost all classes of astrophysical sources. The present effort is focused on delving into this vast dataset and finding interesting phenomena, as [9, 10]. Since the beginning of the project, project partners believed this would be an exciting challenge for citizens interested in astrophysics and student internships. Therefore an educational activity was defined for offering high school students the chance to go through the whole validation process: they study the data and try to discover new sources, or better to characterise already known sources [11]. Apart from good basic science they can learn and use statistics, physics, image handling, the added value is that they could be the very first people in the world to discover and characterize those transient and variable sources: they managed to get their faces on the XMM- Newton cameras for a handful of seconds, just like extras in movies. Since October 2015 several workshops and related educational activities have taken place. This paper describes how the EXTraS science gateway has been used together with the first important discovery, an enigmatic flaring source in the globular cluster NGC 6540 [12]. The paper is organized as follows. Section 2 discusses work related to citizen astronomy, Section 3 presents the EXTraS portal while 4 presents the EXtraS archive. Section 5 describes the activities carried out for engaging

3 students, while Section 6 presents the conclusions.

2. Citizen Astronomy and its Implementation As reported by the Encyclopaedia Britannica4, “The spectrum of projects and initiatives that fall under the umbrella of citizen science is broad, and consequently there is debate surrounding a formal definition”. One possible definition is “A citizen scientist is a volunteer who collects and/or processes data as part of a scientific inquiry. Projects that involve cit- izen scientists are burgeoning, particularly in ecology and the environmental sciences, although the roots of citizen science go back to the very beginnings of modern science itself” [13]. The most famous - if not the first - initiative in this field is SETI@Home [14], which exploits the idle computing time of common citizens in the search for extraterrestrial life since 1999 [15]. Presently, there are several Internet- based citizen science projects and associations devoted to promote such col- laboration paradigms, such as the Citizen Science Alliance5, the OpenScien- tist blog6 and the American Association for the Advancement of Science7. Furthermore publications using data and results collected with the contri- bution of citizen scientists are becoming common in many disciplines, as [16, 17, 18]. Although a valuable motivation for a citizen scientist in joining a science project is the satisfaction of his/her intellectual curiosity, thus contributing to scientific dissemination [19], both the previous definitions as well as the cited activities and publications mainly stress the active role that citizens fulfill when participating in a project [20]. They can: • provide data, as for example in the Great Sunflower Project8 or Crowd- Hydrology [21];

• provide computational resources to elaborate data, for example with the BOINC9 platform;

4http://www.britannica.com/EBchecked/topic/1993687/Citizen-Science-A-Platform- for-Nonprofessionals-Year-In-Review-2014 5http://www.citizensciencealliance.org 6http://www.openscientist.org 7http://www.aaas.org/ 8http://www.greatsunflower.org/ 9http://boinc.berkeley.edu/

4 • participate in the data processing and analysis. The last two items represent the initiatives for involving citizen scientists in astronomy10 [22]. In particular there are several such initiatives, as can be seen in the Spacehack.org website, on the platform or - as regards providing computational resources - among the BOINC projects. Focusing on the data analysis, one of the most important platforms is represented by Zooniverse [23, 24], “a platform for people-powered research where anyone can study authentic objects gathered by researchers, like im- ages of faraway galaxies, by answering simple questions about them”11. This platform hosts several projects, whose creation is straightforward, just re- quiring the upload of the data and a choice of the tasks the volunteers must do, such as answering questions or marking features in the data by drawing or tagging text12. The platform today hosts about 21 live projects related to astronomy [25, 26], resulting in an increasing number of scientific discoveries, e.g. [27, 28, 29]. While Zooniverse is a collaboration between institutions from the U.K. and the U.S. supporting the creation of science gateways with the specific object of offering citizen science tools, CitizenScience.gov is an official U.S. government initiative aiming at supporting federal agencies/institutions in accelerating the use of crowdsourcing and citizen science. This means the focus is on providing a toolkit for improving scientific projects software, i.e. to improve tools originally developed for professionals also to citizens, and on disseminating the advantages of this approach by providing an extensive list13 with more than 430 successful active or past projects. Classification and annotation of data represent the most common tasks, but more complex analyses can be performed. For example the SkyServer initiative hosts several projects for supporting the study of stars and galaxies in the Sloan Digital Sky Survey (SDSS) [30], i.e. the same objects that pro- fessional astronomers study. The focus here is twofold: to support the study and the use of heterogeneous data, e.g. images, spectra, photometric and spectroscopic data, and tools, e.g. for creating astronomical finding charts

10https://www.iau.org/public/themes/citizen-science-projects/ 11https://www.zooniverse.org/about 12https://help.zooniverse.org/getting-started/ 13https://www.citizenscience.gov/catalog

5 for an object seen by the SDSS or navigating the sky14. Also Zooniverse is developing an environment where more advanced data analysis tools can be developed and provided to citizen scientists15. A key factor for the success of such advanced initiatives, as outlined in [22], is represented by “providing user-friendly, web-based tools enabling fairly sophisticated data analysis to be performed by anyone with a browser. Open source tool code is a minimal requirement in this model; finding ways beyond this to support citizen algorithm development seems to be likely to pay off”. One of the most important examples in this direction, that can be represented as an evolution of SkyServer [31], is the SciServer collabo- rative environment for large-scale data-driven science [32]. It uses Jupyter notebooks [33] for advanced analysis in a Cloud environment. Jupyter is an interactive computing environment based on the concept of notebooks, i.e. documents including text, plots, equations, videos but also live code and interactive widgets written in Python, R or other languages. In particular the output generated by running code is embedded in the notebook, which makes it a complete and self-contained record of a computation that can be executed locally or on the same server as the source data. The result is an environment able to support highly complex searches and analyses resulting in millions of sky objects from several databases16. Moreover Jupyter notebooks can be used also for creating science gate- ways [34]. In general, a science gateway can be defined as a set of software, data collections, instrumentation and computational capabilities that are integrated - using different technologies and middleware [35] - via a Web portal (or a desktop application) in a user friendly and effective environ- ment supporting the scientific research and education activities of a specific community. Science gateways are gaining an increasing interest in many com- munities [36, 37], such as the astrophysics one [38, 39, 40], also because one of the best strategies to provide software and data to a scientific community is through a set of services designed following this paradigm. The portal we developed in the project and used for involving citizen scientists [6] is based on this paradigm. In conclusion most of the presented citizen science projects represent sci-

14http://skyserver.sdss.org/dr7/en/tools/started/ 15https://blog.zooniverse.org/2013/11/26/zootools-going-deeper-with-zooniverse- project-data/ and https://github.com/zooniverse 16http://www.sciserver.org/integration/astronomy/

6 ence gateways accordingly to the definition we reported. However, these gateways normally offer to citizens simplified interfaces with respect to those offered to scientists, in order to balance the efforts in their learning curve with the need to provide them enough flexibility to actually carry out effective analyses17. But, at the same time, projects like SciServer allow college stu- dents and and science-literate citizens to access the same tools as researchers who work with Big Data18.

3. The EXTraS Portal

Figure 1: The architecture of the EXTraS science gateway

The EXTraS science gateway is composed of an archive, presented in the following Section, and a portal. The architecture is depicted in Figure 1 and described in details here and in the following Section. The EXTraS portal has the goal of providing users with a seamless envi- ronment to process the observations made available from the XSA with the

17https://daily.zooniverse.org/2017/09/11/keeping-it-simple/ 18http://www.sciserver.org/about/vision/

7 EXTraS pipelines, hereafter called experiments. It provides an user-friendly interface, with no need for the installation of any software.

Experiment configuration. The Jobs Management User interface represents the home page of the EXTraS portal. It provides users with the possibil- ity of creating, submitting and managing the different analysis experiments based on the software developed within the EXTraS project. In particular it presents all the submitted or configured analyses, providing the possibility to create a new analysis starting from an existing configuration or share results with other users. This module is based on AngularJS and it is a complete web app, without any server side code [41]. It uses the Persistence API to store and retrieve experimental data, and it activates the other portal modules corresponding to the different operations available.

Experiment execution. All jobs are managed by the portal, based on comput- ing resources provided by EGI Fedcloud [42] to virtual organizations (VO), i.e.groups of users where members are usually in related research activities. The Workflow Configuration module is responsible for interacting with the user for the creation and configuration of experiments based on the EXTraS software. In the portal every analysis corresponds to a single application, therefore there is no need to explicitly create and manage workflows. For this reason a user interacts directly with an analysis-specific interface (UI), as shown in Figure 2 for the Transient Analysis. Its main aim is to collect the parameters value and to create a namelist, that will be provided to the FedCloud Submission Handler for the actual execution of the job. The FedCloud Submission Handler module manages submission of jobs and provides a full view of the job status, results and logs. In particular the actual submission requires the user specifies as input for the analysis one or more observation identifiers (OBSID) among those contained in the XSA. Each of them corresponds to a job, therefore an analysis configuration can result in multiple jobs executed at a time. During the execution the user can monitor the status of the job by means of the real-time log information the software tool provides. Since its official release at the beginning of 2017, 54 users registered to the portal and submitted experiments for a total of 32,240 core-hour in 2017, 505,380 in 2018 and 10,040 in 2019, as reported by the EGI Accounting

8 portal19.

Results management. When a job terminates, the FedCloud Submission Han- dler provides the possibility to retrieve results and also log information. All the information related to a job (e.g. the configuration parameters, the logs, the results, the ownership/sharing information and possible comments) are stored on the portal database via the Persistence API until it is deleted by the user who owns it. The EXTraS portal provides two further key features: the ability to share an analysis (i.e. the namelist and possibly the results) and support for the interaction and discussion (in terms of comments) among the users sharing it. Sharing a completed job means not only that the experiment results are visible to other users, but also that the configuration is shared and can be used as a starting point for re-submitting the experiment on a new set of data. Thus, a job execution can be replicated by other users that can, for example, validate the experiment results or explore the behaviour by changing one or a few parameter values. Any result computed within the portal is not automatically transferred in the archive, but it has to be validated by the project community, that can use the portal to publicly discuss it.

Non-academic users. Citizen scientists can interact with the portal exactly as the scientists can. In particular they can exploit the EGI Fedcloud in- frastructure for submitting jobs. This has been made possible by the use of “robot certificates” [43]. EGI Fedcloud in fact relies on a single sign-on mechanism to access the federated services based on X.509 certificates and VO membership. Robot certificates were introduced to allow users, who cannot get or are not famil- iar with personal digital certificates, to exploit any distributed infrastructure relying on them in their research activities. The robot certificate is usually associated with a specific application (or function) that the application de- veloper/provider wants to share with all the VO [44]. This is exactly the scenario which has arisen in the EXTraS activities, because portal users are provided with the possibility to run only pre-defined software tools. There- fore the FedCloud Submission Handler module interacts with an e-Token

19 https://accounting.egi.eu/vo admin/cloud/

9 server [45] to submit jobs on the resources granted to the “extras-fp7.eu” VO [46]. Furthermore, the support for the interaction and discussion is very im- portant also for the activity of citizen scientists, that can be assigned a set of XSA observations and parameters to inspect. In this case they can share the results and a preliminary analysis among them and with the scientists, for a fast but effective screening.

Figure 2: The Transient Analysis UI shown by the Workflow configuration module of the EXTraS portal.

4. The EXTraS Data Archive The EXTraS Data Archive forms the primary online repository for the data generated during the EXTraS project. This builds on the pointed and slew XMM-Newton Serendipitous Source Catalogues [47, 48] and supports a wide range of newly-generated X-ray light curves, associated source cata- logues with measures of variability and statistical significance, and derived products such as hardness ratios, power spectra, source classifications and source summary files.

10 The design of the archive is derived from the existing high-energy astro- physics data archive Leicester Database and Archive Service (LEDAS20) at the University of Leicester, which hosts data from several major X-ray mis- sions, including a full set of XMM-Newton EPIC serendipitous data (around 200 million seconds of observing time). The core archive system originally developed for LEDAS has been completely rewritten to current software de- velopment standards (including Model-View-Control architecture and unit testing). Moreover a major technological development in archive provision post-2000 has been the rise of e-Science, and particularly in the field of as- tronomy, the Virtual Observatory (VObs) concept [49, 50]. The goal of the VObs is to provide a standardised set of metadata, access protocols and sup- porting software tools to improve access to, and interoperability between, very large astronomical datasets. The data. EXTraS draws on the 3XMM-DR4 public data to search for pe- riodic, aperiodic and transient sources with much greater fidelity than the basic light curves and Fourier Transforms generated as part of the 3XMM automated processing. This variability is being studied down to the native time resolution of the instruments (typically 73 ms and 2.6 s for the EPIC pn and MOS cameras respectively). Long-term variability is being characterised for all sources with multi-epoch data. Upper limits will be computed in the case of non-detections in specific observations. Catalogues and bulk products produced during the EXTraS project have been transferred to the University of Leicester, UK (UoL) to be incorporated in LEDAS. Details are provided in [51]. A total of 18 TB of LEDAS/EXTraS data is presently held on UoL central archival storage. In order for VObs applications to handle EXTraS time series datasets fully and consistently, the data structures and metadata must be documented according to a standard syntax, as defined by the International Virtual Ob- servatory Alliance (IVOA) [52]. But the existing data models are insufficient to describe EXTraS data products. This is the reason why during the project we consider that the development of a specific data model should follow two basic principles: simplicity, which eases management and adoption, and ad- herence to existing IVOA standards, which increases the probability of wider adoption. To that end, we propose to base the new data model, named ExtDM, on the existing PhotDM standard, with extension to the time do-

20 https://www.ledas.ac.uk

11 main drawn from the lower-level axis definition of the Characterisation DM (CharDM) [51]. We joined the standards group for time series data within the IVOA with the aim of advancing the interests of the EXTraS project, and incorporat- ing any approved standards into the EXTraS infrastructure at the earliest opportunity. The process is ongoing at the time of writing.

The search form. The main page of the archive provides a top-level menu to access EXTraS data products subdivided by the kind of analysis they are re- lated to, i.e. Aperiodic Variability, Search for Periodicity, Transient & Highly Variable Sources, Long-Term Variability and Multiwavelength classification. A combined catalogue, which searches all EXTraS source catalogues, and a basic catalogue cross-matching facility are also provided. Direct catalogue downloads in FITS [53] and VOTable [54] formats are available. It also pro- vides an option to access all catalogues currently held by the LEDAS system (over 900 catalogues at the time of writing) covering a wide variety of mis- sions and wavebands. Online help is available for all catalogues. The catalogue search form, shown in Figure 3, allows searches on a given sky position, using either a cone (fixed radius from search centre), box or rectangle search area. The form makes also possible to type the source name and click on “Resolve Name”. In this case the Simbad database [55] is queried to find the source coordinates for the object. If successful, these are displayed in the Cone Search input box on the query form. A minimal or full set of table columns can be selected, and a variety of output formats displayed. Generation of archive search results in multiple formats (HTML, ASCII table, CSV, VOTable etc) is accomplished using the Twig templating lan- guage for PHP21. Twig is a full-featured presentational language for data output including generation of web pages – it handles branch and loop con- structs and supports multiple inheritance, enabling complex page layouts to be built from a common set of simple blocks. Tabular search results are presented using the Javascript DataTables library22, which produces tables which can be filtered, sorted and paginated on the fly by the end user. An example is shown in Figure 4 for the full list of results, while the bulk prod- ucts associated within each given dataset ID organised by category is shown in Figure 5.

21http://twig.sensiolabs.org/ 22http://www.datatables.net/

12 Figure 3: The advanced catalogue search form in the EXTraS archive

Visualization. Beside the download of the results data, an expanding set of interactive visualisations for EXTraS catalogue and bulk product data is provided. The visualisations are generated directly in the browser (i.e. by clicking on the blue “PREVIEW” button at left links in Figure 5) and require no additional software installation. The time-series analysis tasks generate science output typically in FITS format, which cannot natively be displayed in a web browser. A static “quick look” graphical view for e.g. spectra and time series is already available for sources present in the 3XMM-DR5 catalogue in PNG and PDF formats. For new sources the archive data ingest script is able to generate such static images automatically using e.g. Python matplotlib.

13 Figure 4: The full list of results as an HTML page. The buttons on the left allow one to explore all the products and the links to directly download selected files

Moreover the archive interface has been enriched with dynamic, interac- tive visualisations using the Python graphics library Bokeh23 running on the Python WSGI application server gunicorn24. Requests are sent to the visual- isation server using a REST interface, and the server responds with a JSON data block which is rendered by the view using standard AJAX techniques. Examples of EXT-PDA visualisation output are shown in Figures 6 and 7. The use of JSON for data transfer enables additional features to be added to EXTraS archive data analysis in the future using standard Javascript libraries, such as jStat25.

5. The Education Activities During the project a workshop scheme has been designed to let high school students play as researchers, i.e. they are introduced to the scientific method and they learn how to work with the EXTraS software tools and results. The formal goal of the students is the validation and classification of astronomical sources with a peculiar temporal behaviour (pre-selected by researchers), by exploring the data extracted from the EXTraS data archive.

23http://bokeh.pydata.org/ 24http://gunicorn.org/ 25http://www.jstat.org

14 Figure 5: The summary page of the catalogue results for each single source

The workshop has been designed as a live role playing-like activity with a strong hands-on strategy and some formal education: the students are divided into cooperating groups that attend laboratories and lessons given by researchers. The groups gather information about high energy astrophysics, space technologies, data mining and analysis. Their ability to interact with experts is fundamental to reach their goals. A typical workshop lasts about 1-2 weeks (40-80 hours). We explicitly adopt an inquiry-based learning strategy, together with a peer-to-peer edu- cation approach. This global strategy both fosters the critical thinking and

15 Figure 6: Bokeh visualisation of Aperiodic Variability MOS1 timeseries engages the students, who act as though they were researchers. A short but clear framework for describing inquiry in astronomy is provided by the IAU project astroEDU26 [56]. During the analysis of possibly interesting X-ray source candidates they are guided by five major questions: 1. Is the selected candidate a real, astrophysical X-ray source? 2. Is the peculiar temporal behaviour related to any instrumental effect? 3. Has the source a possible counterpart at other wavelengths (e.g. opti- cal)? 4. What kind of astrophysical object or phenomenon might it be? 5. Might it be an important scientific discovery? We run several workshops and other education activities laying on the workshop experience in Italy, Germany and United Kingdom, engaging more than 200 high school students, from 2015 onwards. Details are available on

26http://astroedu.iau.org/ebl/

16 Figure 7: Bokeh visualisation of Long-Term Variability timeseries the project website27. The first part of the workshop aims at engaging participants in contempo- rary astrophysics, with a specific regard to the high energy band. Researchers or teachers introduce them to the contents and the technical language of as- tronomy (such as field of view, electromagnetic spectrum, time resolution, spatial resolution and energy resolution, light curve, photons and so on) and maths (such as probability, confidence level and so on). At this stage, re- searchers and teachers should try to avoid a frontal approach. They are requested to solicit questions and discussion on the treated topics, instead of acting as pure experts. The goal of this part is to provide the students with the right method and conceptual tools to face the tasks of data reduction and interpretation, so that some kind of passive learning is assumed on the part of the students. The second part of the workshop aims at giving the students the oppor- tunity to analyse the results produced by the EXTraS software tools, either

27http://www.extras-fp7.eu/index.php/educational/list-of-workshops

17 stored in the archive, or generated within the workshop by using the por- tal. Different analysis tasks have been implemented, based on the use of different resources and tools, and related to a large diversity of astrophysical phenomena.

Using the archive. A sample of sources with peculiar temporal behaviour is pre-selected by researchers, based on a blind search in the results from the automatic characterization of variability performed within EXTraS. Such sources are proposed to the students for validation and classification. We considered the case of short-term variability, ranging from a few seconds to a few hours (i.e. on a time scale shorter than the duration of an XMM observation) – in particular, focusing on flaring/transient sources (displaying a sudden, large increase in flux, and a subsequent fading to the pre-burst flux level) and on eclipsing/dipping sources (displaying a sudden drop in flux, followed by a fast rise to the pre-eclipse level). For each source, students are asked to:

1. confirm that the source candidate corresponds to a real astronomical source by inspecting the EPIC sky images (is the source candidate close to a CCD edge or defect? Is it located close to a very bright source or superimposed to a bright diffuse source?); 2. confirm that the candidate flare or dip is real, by inspecting the light curves of the three EPIC cameras; 3. exclude the possibility that the flare/dip is related to imperfect back- ground subtraction, by inspecting the background light curves (is the feature simultaneous to a background flare?); 4. check whether the feature can be detected in multiple energy ranges, by inspecting energy-resolved light curves; 5. check whether the source is already known and classified, and whether its peculiar temporal behaviour had already been observed and pub- lished, by using the Simbad database; 6. if the source is not known/classified, search for possible counterparts in multi-wavelength catalogues and using the ESASky tool28; 7. propose a classification for the source and an interpretation for its tem- poral behaviour;

28http://sky.esa.int

18 8. try to determine whether this could be an interesting discovery.

All of the above steps can be completed by inspecting EXTraS products in the archive, or using online resources linked to the archive web pages (e.g. the 3XMM catalogue and its products, the Simbad database, results of cross-correlation with multi-wavelength catalogues). We decided to focus on the cases of flares and eclipses, because the simple shape of the temporal modulation eases several steps of the analysis. In prin- ciple, the same approach could be applied to any kind of temporal variability. For instance, an interesting possibility could be to study sources with large variability on a long time scale (ranging from few days to few years - e.g. between different XMM observations). Details on how the students use the archive interface are provided in Appendix A.

Using the portal. The EXTraS science gateway has also been designed to give non-expert users the possibility of extending the EXTraS analysis to new XMM observations or reproducing the EXTraS analysis of old observa- tions with different settings and input parameters. During the workshop, the students have also the opportunity to experience this additional step in the work of a researcher by selecting targets from recent XMM observations, not included in the EXTraS archive, and performing the full EXTraS analysis, starting from raw data and reaching down to the interpretation of results. The choice of the targets of the new observations is left to the groups of students, with only some general suggestions by the researchers. This resulted in some groups selecting particular sky regions (e.g., star clusters or nearby galaxies) and other choosing random positions (e.g., selecting the most recent or the longest observations). After the selection of the XMM observations (the observation ID is the main input data that students enter into the portal interface), the students could process the data (automatically downloaded from XSA) with the same Transient & High Variability software developed and used in the project, with the possibility to modify a limited set of parameters (e.g., the detection threshold), using the portal UI as described in details in [6]. In order to obtain the detection of a significant number of transient candidates to validate, the researchers suggested suitable values, but encouraged the students to repeat the analysis with different parameters. Since the full data processing of each observation takes from a few to several hours, each group could select only a handful of observations and

19 the data processing could be performed by submitting the jobs to Fedcloud resources and collecting the results on the following day. After the data processing the results are stored in a TAR (Unix Tape Archive format) file ready to be downloaded to the students’ laptops, independently of the archive interface. The results can then be analyzed by the students following a procedure similar to the one described before for the analysis of sources in the EXTraS archive. Given the limited number of observations that could be processed during the workshops, compared to the much larger amount of computing time used during the EXTraS project to populate the archive, only a few possibly real X-ray transients could be discovered by the students. This aspect is of particular importance. On the one side we showed the students a significant part of the scientific process leading to scientific discovery and the difficulty in obtaining scientifically relevant results. On the other side this limited test demonstrates how the use of the portal by a much broader community of citizen scientists, guided by expert astronomers, could substantially support the scientific community in performing such analysis. Despite the low probability of finding a new transient, analyzing the ob- servation 0781690201 a group of students discovered a new X-ray transient candidate. This is an observation of a region of the Galactic plane with a duration of ∼ 78 ks. The Galactic plane is characterized by the presence of numerous stars that can give rise to stellar flares, which are the most fre- quent transient events expected in the X-ray sky. Using the visualization software developed within the EXTraS project for the screening analysis29, the students identified the new X-ray transient by analyzing the light curve of the three EPIC instruments, selecting the 0.5-4.5 keV energy band. The peak in the light curve is clearly visible in Fig 8, with a duration of ∼ 800 seconds, in a period of low background. The students then entered the coordinates of the transient candidate in the ESASky30 online tool to search for counterparts in other energy bands, finding a correspondence with a clearly visible star in the Optical/IR bands.

The first result: the peculiar transient J1806-27. A very unusual phenomenon was discovered within a workshop held at INAF/IASF Milano in September 2017. Six high-school students participated in the research work: Lorenzo

29http://www.extras-fp7.eu/index.php/scientific-community/softwaretools 30http://sky.esa.int

20 Figure 8: The analysis of a transient candidate with the EXTraS screening visualization tool.

Apollonio, Bartolomeo Bottazzi-Baldi, Martino Giobbio, Razvan Patrolea, Elena Pecchini and Cinzia Torrente from Liceo Scientifico G. B. Grassi, Saronno, Italy. As an input for the students’ work, we blindly selected ∼ 200 sources in the EXTraS database – half of them being consistent with a transient behaviour, and half with dipping behaviour. The students were arranged in two groups, with the task of scanning the list of sources, following the steps described in the previous section. At the end of the workshop, the two groups presented their results – a small sample of sources they had selected as potentially interesting. The case of the candidate transient 3XMM J180608.9-274553 (J1806-27) drew our attention. The transient is detected with high significance by all EPIC cameras and lasts a few minutes only, as shown in Fig. 9. The source, which had never been studied before, lies within the core of the Galactic globular cluster NGC 6540. The students’ report triggered a more detailed analysis by a team of astronomers at INAF/IASF. The transient, with a duration of ∼ 300 s and a symmetric rise/decay time, turned out to be very peculiar, defying any

21 classification. Assuming the source to be in NGC 6540, its peak luminosity of ∼ 1034 erg s−1 is orders of magnitude lower than the one of thermonuclear X-ray bursts from neutron stars in low-mass X-ray binary systems; a flare from a coronally active star in the cluster can also be excluded because of the short duration, a factor ∼ 100 smaller than expected based on the known correlation between luminosity and duration of stellar flares. The possibility of a less luminous flare from a foreground star is also unlikely, because of the lack of an optical counterpart in archival images collected with the . The properties of this peculiar transient have been published in full detail in [57]. The discovery and the active role played by students were the subject of an article on the web site of the European Space Agency31, and were also included in the weekly “Research highlights” section by the Nature magazine32.

Figure 9: The brightness of J180608.9274553 changed by up to 50 times its normal level in 2005, and quickly fell again after about five minutes.

6. Conclusions and Future Work The EXTraS project, being related to rare, exotic phenomena and to extreme, poorly known astrophysical sources, has great potential for the

31https://goo.gl/rUigBv 32https://www.nature.com/articles/d41586-018-05959-4

22 popularisation of science in general and of astronomy in particular. The researchers involved in the project deem outreach and involvement with schools and other institutions a fundamental component of their scien- tific mandate. Astronomy has a fundamental role in enhancing interest in science in young people, laying the basis for a career in Science, Technology, Engineering and Mathematics (STEM). The outreach and involvement of people of all ages is very important as well, because it generates interest in scientific research and fosters the diffusion of scientific culture. Within this framework, the availability of a vast catalogue and of the appropriate software both of them online and freely usable - allows us to think of the EXTraS activity as a Citizen Science candidate project, with a particularly interesting activity for the classroom. This paper presented the EXtraS science gateway and the didactic ac- tivity schema for high school that could foresee: the extraction of transient candidates through the algorithm implemented by the EXTraS team; the validation procedure; the search for a counterpart; a final discussion of the results. The success of the workshops held in these years shows that a citizen science approach can be very useful at school, even if the proposed activity is far outside the students’ curricula, provided that we offer them a stimu- lating challenge and framework. In addition, we showed that students can be engaged in a high level challenge. Indeed, the use of a Science Gateway as a citizen science platform is part of the challenge itself. At first sight, it can seem to be a complex tool to be used by students and teachers, such as an actual scientific laboratory. However, as soon as they become familiar with it, it rapidly turns into a chance to explore a whole scientific world: they can create, submit, manage different analysis experiments with their own choice of configuration parameters; they can retrieve and discuss the results and, moreover, they share parameters, data and results with other users (scientists or students), as we describe. This fosters the formation of a sort of enlarged and motivated user community, with a very strong effect of engagement of the non academic world which is one of the current assets of science communication and life-long learning. Our next step is to launch the activity as a totally citizen science one by improving existing didactic material for an on-line e-learning platform: we believe every interested citizen could be able to exploit the science gateway to identify and cooperate with us to analyze possibly interesting X-ray source candidates.

23 Acknowledgments This research has made use of data produced by the EXTraS project, funded by the European Union’s Seventh Framework Programme under grant agreement no 607452. This work used the EGI infrastructure with the ded- icated support of CYFRONET-CLOUD and INFN-CATANIA-STACK. Au- thors would like to thanks the EGI staff, in particular Giuseppe La Rocca and Diego Scardaci, for their valuable support. J180627 was selected as a potentially interesting source by L. Apollonio, B. Bottazzi-Baldi, M. Giob- bio, R. F. Patrolea, E. Pecchini and C. A. Torrente (Liceo Scientifico G. B. Grassi, Saronno) during their stage at INAF-IASF Milano in 2017, Septem- ber, within the Alternanza Scuola-Lavoro initiative of the Italian Ministry of Education, University and Research.

Appendix A. Study of candidate flaring sources Each group is assigned a list of pre-selected candidate flaring sources – an ASCII file with one row per source, including the source sky coordinates and a small set of other source identifiers (e.g. observation ID, exposure ID, unique source ID). For each source in the list, each group should complete the following steps, guided by the 5 major questions listed in Sect. 5.

1. Go to the EXTraS Archive main data access page, select the “Ape- riodic, short-term variability” query form and search the source by coordinates. 2. in the Results page, a list of all the XMM exposures (for each EPIC camera) analyzed by EXTraS is displayed. Select the appropriate ob- servation and exposure ID for the pn camera and click on the Products button. 3. a list of all bulk products generated by EXTraS for the pn camera in the selected exposure is displayed. Follow the link to the 3XMM summary page. Inspect images: is the source in a noisy/confused region? Is it close to a detector edge?

Step 3 allows one to answer the major question: “Is the selected candidate a real, astrophysical X-ray source?”

5. Go back to the list of bulk products; select the Bayesian block light curve (full energy range) using the search box (e.g. by searching for

24 the “bblc” string within the names of the products) and visualize it by clicking on the Preview button. Answer the question: is a flare visible? 6. Go back to the list of bulk products; select and visualize the uniform bin light curve (500 s bin, full energy range). Is a flare visible? Identify the peak and read its time, count rate and the corresponding uncertainty by clicking on the selected data point in the plot. 7. Go back to the list of bulk products; select and visualize the background light curve. Does it show any correlation between behaviour of the source and that of the background? Is there any background flare temporally coincident with the source candidate flare? 8. Go back to the list of bulk products; select and visualize the uniform bin light curve (500 s) in the super-soft, soft and hard energy ranges. Is the flare visible in all of the energy ranges? 9. Go back to Results page, select for same observation the list of bulk products corresponding to the MOS1 camera. 10. Repeat steps 4-5 for the MOS1 camera. Is the flare visible in the Bayesian blocks light curve? In the uniform bin light curve? Is it the same feature seen with the pn camera? (simultaneous, with a similar shape of the light curve and roughly half the count rate measured by the pn?) 11. Go back to Results page, select for same observation the list of bulk products corresponding to the MOS2 camera. 12. Repeat steps 4-5 for the MOS2 camera. Is the flare visible in the Bayesian blocks light curve? In the uniform bin light curve? Is it the same feature seen with the pn camera? (same checks as for MOS1) The above steps 4-11 allow one to answer the major question “Is the peculiar temporal behaviour related to some instrumental effect?” 13. From the bulk products page of any camera, go again to the 3XMM summary page and follow the link to SIMBAD results. 14. A source list is displayed, ranked by (increasing) distance from the position of the 3XMM source. Select entries within 10 arcsec from the 3XMM position by clicking on their identifier. 15. For each selected source, read the classification and take note of all the relevant information (e.g. distance, multi-wavelength counterparts, association to a Globular cluster or to a remnant...). Fol- low the References link, identify any published paper that could in- clude analysis of the XMM dataset, scan it to assess if the flare is

25 detected/described; also take note of any X-ray observation performed with other instruments. 16. Start the ESASky app in the browser (not yet integrated with the EXTraS archive) and search the source by coordinates. A portion of the sky surrounding the selected position is visualized. It is possible to display the sky as seen at different wavelengths by different instruments. 17. Assess the presence of any counterpart at different wavelengths (near infrared, optical, gamma-ray).

The steps 12-16 allow one to answer the major question “Has the source a possible counterpart at other wavelengths (e.g. optical)?”.

18. Using all the collected information, try to answer the last two major questions: “What kind of astrophysical object or phenomenon might it be?” and “Might it be an important scientific discovery?”

References References [1] A. C. Fabian, K. A. Pounds, R. D. Blandford, Frontiers of X-Ray As- tronomy. Cambridge University Press, 2004.

[2] C.R. Kitchin, Astrophysical techniques, CRC Press, 2013.

[3] L. Strueder, U. Briel, K. Dennerl, R. Hartmann, E. Kendziorra, N. Meidinger, E. Pfeffermann, C. Reppin, B. Aschenbach, W. Bornemann, et al., The European Photon Imaging Camera on XMM-Newton: The pn-CCD camera, Astronomy & Astrophysics, vol. 365, pp. 18, 2001

[4] M. J. L. Turner, A. Abbey, M. Arnaud, M. Balasini, M. Barbera, E. Belsole, P. J. Bennie, J. P. Bernard, G. F. Bignami, M. Boer, et al., The European Photon Imaging Camera on XMM-Newton: The MOS cameras, Astronomy & Astrophysics, vol. 365, pp. 27, 2001

[5] A. De Luca, R. Salvaterra, A. Tiengo, D. D’Agostino, M.G. Watson, F. Haberl, J.Wilms, Science with the EXTraS Project: Exploring the X-Ray Transient and Variable Sky. The Universe of Digital Sky Sur- veys, Astrophysics and Space Science Proceedings, vol. 42, pp. 291-295, Springer, 2016.

26 [6] D’Agostino, D., Roverelli, L., Zereik, G., La Rocca, G., De Luca, A., Salvaterra, R., ... & Tiengo, A. (2019). A science gateway for Exploring the X-ray Transient and variable sky using EGI Federated Cloud. Future Generation Computer Systems, V. 94, p. 868-878.

[7] R. Mushotzky, What do Astronomers Really Want? Astronomical Data Analysis Software and Systems (ADASS XX) , ASP Conf. Proc., vol. 442, p.235-241, 2011.

[8] Israel, G.L., Belfiore, A., Stella, L., Esposito, P., Casella, P., De Luca, A., Marelli, M., Papitto, A., Perri, M., Puccetti, S., Rodrguez Castillo, G.A., Salvetti, D., Tiengo, A., Zampieri, L., D’Agostino, D., Greiner, J., Haberl, F., Novara, G., Salvaterra, R., Turolla, R., Watson, M., Wilms, J., Wolter, A. An accreting pulsar with extreme properties drives an ultraluminous x-ray source in NGC 5907, Science, 355 (6327), pp. 817- 819, 2017.

[9] Haberl, F., Israel, G. L., Castillo, G. R., Vasilopoulos, G., Delvaux, C., De Luca, A., ... & Tiengo, A. (2017). EXTraS discovery of two pulsators in the direction of the LMC: a Be/X-ray binary pulsar in the LMC and a candidate double-degenerate polar in the foreground. Astronomy & Astrophysics, 598, A69.

[10] Esposito, P., Israel, G. L., Belfiore, A., Novara, G., Sidoli, L., Rodriguez Castillo, G. A., ... & Read, A. M. (2015). EXTraS discovery of an 1.2- s X-ray pulsar in M 31. Monthly Notices of the Royal Astronomical Society: Letters, 457(1), L5-L9.

[11] Sandrelli, S., Belfiore, A., D’Agostino, D., ... & Tiengo, A. (2018). Hunt- ing for Black Holes: a Citizen Science Exploration of the X-ray Transient Sky. In Book of Proceedings Communicating Astronomy with the Public Conference 2018 2nd Edition, ISBN:978-4-908895-02-9, NAOJ publisher, Tokyo, Japan.

[12] Teens stumble on a new class of astronomical object. Nature, 560(7719):413, 2018.

[13] Silvertown, J. (2009). A new dawn for citizen science. Trends in ecology & evolution, 24(9), 467-471.

27 [14] Anderson, D. P., Cobb, J., Korpela, E., Lebofsky, M., & Werthimer, D. (2002). SETI@ home: an experiment in public-resource computing. Communications of the ACM, 45(11), 56-61.

[15] Reichhardt, T. (1999). A million volunteers join the online search for extraterrestrial life. Nature 400.

[16] Jeong, S. J., Medvigy, D., Shevliakova, E., & Malyshev, S. (2013). Pre- dicting changes in temperate forest budburst using continental-scale ob- servations and models. Geophysical Research Letters, 40(2), 359-364.

[17] Silvertown, J., Cook, L., Cameron, R., Dodd, M., McConway, K., Worthington, J., et al. (2011). Citizen science reveals unexpected continental-scale evolutionary change in a model organism. PLoS One, 6(4), e18927.

[18] Knispel, B., Allen, B., Cordes, J. M., Deneva, J. S., Anderson, D., Aulbert, C., et al. (2010). Pulsar discovery by global volunteer comput- ing. Science, 329(5997), 1305-1305.

[19] Trumbull, D. J., Bonney, R., Bascom, D., & Cabral, A. (2000). Thinking scientifically during participation in a citizen-science project. Science education, 84(2), 265-275.

[20] Haklay, M. (2013). Citizen science and volunteered geographic informa- tion: Overview and typology of participation. In Crowdsourcing geo- graphic knowledge (pp. 105-122). Springer Netherlands.

[21] Lowry, C. S., & Fienen, M. N. (2013). CrowdHydrology: crowdsourcing hydrologic data and engaging citizen scientists. Ground Water, 51(1), 151-156.

[22] Marshall, P. J., Lintott, C. J., & Fletcher, L. N. (2015). Ideas for citizen science in astronomy. Annual Review of Astronomy and Astrophysics, 53, 247-278.

[23] Cox, J., Oh, E. Y., Simmons, B., Lintott, C., Masters, K., Greenhill, A., ... & Holmes, K. (2015). Defining and measuring success in online citizen science: A case study of Zooniverse projects. Computing in Science & Engineering, 17(4), 28-41.

28 [24] Simpson, R., Page, K. R., & De Roure, D. (2014, April). Zooniverse: observing the world’s largest citizen science platform. In Proceedings of the 23rd international conference on world wide web (pp. 1049-1054). ACM.

[25] A. Klesman, Zooniverse: A citizen science success story. Astronomy Magazine, October 2018.

[26] Simpson, R. J., Povich, M. S., Kendrew, S., Lintott, C. J., Bressert, E., Arvidsson, K., ... & Smith, A. M. (2012). The project first data release: a bubblier galactic disc. Monthly Notices of the Royal Astronomical Society, 424(4), 2442-2460.

[27] Simmons, B. D., Lintott, C., Willett, K. W., Masters, K. L., Kartaltepe, J. S., Huler, B., ... & Smethurst, R. J. (2016). : Quantitative visual morphological classifications for 48,000 galaxies from CANDELS. Monthly Notices of the Royal Astronomical Society, stw2587.

[28] Christiansen, J. L., Crossfield, I. J., Barentsen, G., Lintott, C. J., Bar- clay, T., Simmons, B. D., ... & Allen, C. (2018). The K2-138 System: A Near-resonant Chain of Five Sub-Neptune Planets Discovered by Citizen Scientists. The Astronomical Journal, 155(2), 57.

[29] Kuchner, M. J., Faherty, J. K., Schneider, A. C., Meisner, A. M., Filip- pazzo, J. C., Gagn, J., ... & Mokaev, K. (2017). The first discovered by the : Planet 9 citizen science project. The Astrophysical Journal Letters, 841(2), L19.

[30] Raddick, M. J., Thakar, A. R., Szalay, A. S., & Santos, R. D. (2014). Ten years of SkyServer II: How astronomers and the public have embraced e-science. Computing in Science & Engineering, 16(4), 32-40.

[31] Szalay, A. S. (2018). From SkyServer to SciServer. The ANNALS of the American Academy of Political and Social Science, 675(1), 202-220.

[32] Medvedev, D., Lemson, G., & Rippin, M. (2016, July). SciServer com- pute: Bringing analysis close to the data. In Proceedings of the 28th International Conference on Scientific and Statistical Database Man- agement (p. 27). ACM.

29 [33] Kluyver, T., Ragan-Kelley, B., Prez, F., Granger, B. E., Bussonnier, M., Frederic, J., ... & Ivanov, P. (2016, May). Jupyter Notebooks-a publish- ing format for reproducible computational workflows. In ELPUB (pp. 87-90). Proceedings of the 20th International Conference on Electronic Publishing

[34] Kacsuk P (ed), Science Gateways for Distributed Computing Infrastruc- tures, ISBN 978-3-319-11268-8, 2014.

[35] Galizia, A., Roverelli, L., Zereik, G., Danovaro, E., Clematis, A., & DAgostino, D. (2019). Using Apache Airavata and EasyGateway for the creation of complex science gateway front-end. Future Generation Computer Systems, V. 94, pp. 910-919.

[36] K.A. Lawrence, M. Zentner, N. Wilkins-Diehr, J.A. Wernert, M. Pierce, S. Marru, S. Michael, Science gateways today and tomorrow: Positive perspectives of nearly 5,000 members of the research community. Con- currency and Computation: Practice and Experience , Vol. 27, Iss. 16, pp. 4252-4268, 2015.

[37] DAgostino, D., Danovaro, E., Clematis, A., Roverelli, L., Zereik, G., & Galizia, A. (2016). From lesson learned to the refactoring of the DRIHM science gateway for hydro-meteorological research. Journal of Grid Com- puting, 14(4), 575-588.

[38] U. Becciani, E. Sciacca, A. Costa, P. Massimino, C. Pistagna, S. Riggi, F. Vitello, C. Petta, M . Bandieramonte, M. Krokos, Science gateway technologies for the astrophysics community, Concurrency and Compu- tation: Practice and Experience, 27, pp. 306-327, 2015.

[39] A. Costa, P. Massimino, M. Bandieramonte, U. Becciani, M. Krokos, C. Pistagna, S. Riggi, E. Sciacca, F. Vitello, An Innovative Science Gateway for the Cherenkov Telescope Array. J Grid Computing vol. 13, issue 4, pp. 547-559, 2015.

[40] S. Sanchez-Exposito, P. Martin, J. E. Ruiz, L. Verdes-Montenegro, J. Garrido, R. Sirvent, A. Ruiz Falco, R. M. Badia, Web Services as Build- ing Blocks for Science Gateways in Astrophysics. J Grid Computing vol. 14, issue 4, pp. 673-685, 2016.

30 [41] A. Galizia, G. Zereik, L. Roverelli, E. Danovaro, A. Clematis, and D. D’Agostino, Json-GUI - a module for the dynamic generation of form- based web interfaces. SoftwareX 9, 28-34, 2019.

[42] D. C. H. Wallom, M. Turilli, M. Drescher, D. Scardaci and S. Newhouse, Federating Infrastructure as a Service Cloud Computing Systems to Cre- ate a Uniform E-infrastructure for Research, Proceedings of the IEEE 11th International Conference on e-Science, pp. 155-164, 2015.

[43] EUGridPMA, Guideline on IGTF Approved Robots, OID 1.2.840.113612.5.4.1.1.1.6, 2014. http://goo.gl/p42E1X

[44] R. Barbera, G. Andronico, G. Donvito, A. Falzone, J. J. Keijser, G. La Rocca, L. Milanesi, G. P. Maggi andS. Vicario A grid portal with robot certificates for bioinformatics phylogenetic analyses Concurrency and Computation: Practice and Experience Volume 23, Issue 3, pages 246-255, 2011

[45] Valeria Ardizzone, Roberto Barbera, Antonio Calanducci, Marco Far- getta, E. Ingr, Ivan Porro, Giuseppe La Rocca, Salvatore Monforte, R. Ricceri, Riccardo Rotondo, Diego Scardaci, Andrea Schenone: The DECIDE Science Gateway. Journal of Grid Computing 10(4): 689-707 (2012)

[46] EGI-EXTraS Service Level Agreement. http://goo.gl/TqUm9Y

[47] Saxton, R. D., Read, A. M., Esquej, P., Freyberg, M. J., Altieri, B., & Bermejo, D. (2008). The first XMM-Newton slew survey catalogue: XMMSL1. Astronomy & Astrophysics, 480(2), 611-622.

[48] Rosen, S. R., Webb, N. A., Watson, M. G., Ballet, J., Barret, D., Braito, V., ... & Denkinson, G. (2016). The XMM-Newton serendipitous survey- VII. The third XMM-Newton serendipitous source catalogue. Astron- omy & Astrophysics, 590, A1.

[49] Board, S. S., National Research Council, & Astronomy and Astrophysics Survey Committee. (2002). Astronomy and Astrophysics in the New Millennium: Panel Reports. National Academies Press.

[50] Hanisch, R. J., Berriman, G. B., Lazio, T. J. W., Bunn, S. E., Evans, J., McGlynn, T. A., & Plante, R. (2015). The Virtual Astronomical

31 Observatory: Re-engineering access to astronomical data. Astronomy and Computing, 11, 190-209.

[51] Law-Green J. D., EXTraS Deliverable D8.12 - EXTraS IVOA Data Model, 2017. https://bit.ly/2F2x9q7

[52] Graham, M. et al. (2013), Utypes: current usages and practices in the IVOA, IVOA Note 13 Feb 2013, http://www.ivoa.net/documents/Notes/UTypesUsage/

[53] NASA/Science Office of Standards and Technology, Definition of the Flexible Image Transport System (FITS), NOST 100-2.0, NASA/NOST, available at: http://archive.stsci.edu/fits/fits standard/

[54] International Virtual Observatory Alliance, VOTable Format Definition ver. 1.4, available at http://www.ivoa.net/documents/VOTable/

[55] Wenger, M., Ochsenbein, F., Egret, D., Dubois, P., Bonnarel, F., Borde, S., ... & Monier, R. (2000). The SIMBAD astronomical database-The CDS reference database for astronomical objects. Astronomy and As- trophysics Supplement Series, 143(1), 9-22.

[56] Heenatigala, T., Russo, P., Strubbe, L., & Gomez, E. (2015). IAU as- troEDU: an open-access platform for peer-reviewed astronomy education activities. IAU General Assembly, 22.

[57] Mereghetti, S., De Luca, A., Salvetti, D., Belfiore, A., Marelli, M., Paizis, A., ... & Tiengo, A. (2018). EXTraS discovery of a peculiar flar- ing X-ray source in the Galactic globular cluster NGC 6540. Astronomy & Astrophysics, 616, A36.

32