GEO logo
GEO logo
white middle piece of banner
GEO circles

GEO 2016 transitional Work Program


Research Data Science Summer Schools

Activity ID: 32


The ever-accelerating volume and variety of data being generated is having a huge impact of a wide variety of research disciplines, from the sciences to the humanities: the international, collective ability to create, share and analyse vast quantities of data is having a profound, transformative effect.  What can justly be called the ‘Data Revolution’ offers many opportunities coupled with significant challenges.  Prominent among these is the need to develop the necessary professions and skills.  There is a recognised need for individuals with the combination of skills necessary to optimise use of the new data sets. Such individuals may have a variety of different titles: Data Scientist, Data Engineer, Data Analyst, Data Visualiser, Data Curator. All of them are essential in making the most of the data generated.
Contemporary research – particularly when addressing the most significant, transdisciplinary research challenges – cannot effectively be done without a range of skills relating to data.  This includes the principles and practice of Open Science and research data management and curation, the use of a range of data platforms and infrastructures, large scale analysis, statistics, visualisation and modelling techniques, software development and annotation, etc, etc. The ensemble of these skills, we define as ‘Research Data Science’.

Modern Research Data skills are common to all disciplines and training in ‘Research Data Science’ needs to take this into account.  For example, all disciplines need to ensure that research is reproducible and that provenance is documented reliably and this requires a transformation in practice and the promotion of ‘Research Data Science’ skills.

It is strategic priority for both CODATA and the Research Data Alliance to build capacity and to develop skills, training young researchers in the principles of Research Data Science. Particular attention is paid to the needs of young researchers in low and middle income countries (LMICs). It is important that Open Data and Open Science benefit research in LMICs and the unequal ability to exploit these developments does not become another lamentable aspect of the ‘digital divide’.  On the contrary, it has been argued that the ‘Data Revolution’ provides a notable opportunity for reducing that divide in a number of respects.
This activity relates most specifically to the GEO Strategic Objective of ‘Engage’ and the ‘Capacity Building’ activity therein.  The promotion and development of data science skills, as described here, is an important component of capacity building and essential to the greater use and reuse of earth observation data to meet Societal Benefit Areas.

The vision for the schools a series of data science short courses that use a quality assured set of reusable material, are supported by online delivery and are quality controlled and accredited by an appropriate body or bodies so that they can count towards students post-graduate qualifications.  The CODATA-RDA Working Group is seeking to put the mechanisms for these important features in place.

The CODATA-RDA Research Data Science Summer Schools will:

  • address a recognised need for Research Data Science skills across disciplines;
  • follow an accredited curriculum;
  • provide a pathway from a broad introductory course for all researchers (Vanilla) through more advanced and specialised courses (Flavours and Toppings);
  • be reproducible: all materials will be online with Open licences;
  • be scalable: emphasis will be placed on Training New Teachers (TNT) and building sustainable partnerships.

Leads and Contributors

CODATA and RDA. For all the schools, the CODATA-RDA Working Group is collaborating with a wide number of partners and to the greatest degree possible re-using available materials.

2016 Activities

Vanilla School: The first school, named ‘Vanilla’ by analogy to the most basic flavour of ice cream, will provide a bedrock of introductory material, common to all research disciplines, and upon which more advanced schools can build. This school is designed to run for up to two weeks, for what the participants will gain, see the Reference Document. The programme will be run in partnership with the Software and Data Carpentry communities and the UK’s Digital Curation Centre.  Other partnerships are being explored. The first full Vanilla School will take place on 1-12 August 2016 at the International Centre for Theoretical Physics, Trieste.

Flavoured Schools: Schools following Vanilla will be more advanced and specialised, refined as required to the ‘Research Data Science’ needs of particular disciplines. Such ‘flavoured’ schools, which will run for 1 or 2 weeks, will allow a student to have a more specialised knowledge in Data Science, as it is applied in a more specific, disciplinary research context.  A flavoured school will not necessarily run directly after a Vanilla school and may be held in a completely different location.

Discussions are ongoing on schools on:

The first more advanced Flavoured School is likely to take place over one week in the Spring of 2016 at the University of Cape Town.

2016 Resources

  • The first full introductory or Vanilla course will take place from 1-12 August 2016 at the Abdus Salam International Centre for Theoretical Physics in Trieste, Italy. As host, and following their general practice, the ICTP will provide accommodation and subsistence for up to 120 students.  The ICTP has committed 15K euros, TWAS 10K euros and CODATA at least 5K euros to support student travel. The current funding from ICTP, TWAS and CODATA will be prioritized for participants from LMICs. The Working Group is looking for additional support from partner organizations, funders and sponsors.  Thanks to the hosting support, funds will be used entirely for student and instructor travel. 
  • Resources for Flavoured Schools will be confirmed with the confirmation of the schools.
  • Additional activities for 2016 (recommended if additional resources made available)

Future Plans

The Working Group is liaising with a number of partners to host schools in future years.  The initiative builds on events held by CODATA in Beijing, Nairobi and Bangalore.  As well as the various organisations mentioned, the WG is exploring whether the regional offices of the International Council of Science and The World Academy of Science can host schools from 2017.
Strong emphasis will be placed on Training New Teachers.  Specific components and accreditation for participants wishing to instruct on and lead future schools will be established.

Leadership & Contributors (this list is being populated)

No data found...



Contact Us


phone: +41 22 730 8505

fax: +41 22 730 8520


7 bis, avenue de la Paix
Case postale 2300
CH-1211 Geneva 2

Follow Us on

GEO Secretariat
7 bis, avenue de la Paix
Case postale 2300
CH-1211 Geneva 2

phone: +41 22 730 8505
fax: +41 22 730 8520