I will teach how to organize, transform, analyse and visualize small and big data, as well as how to effectively communicate the outcomes of the workflow. This is a path starting at Edgar Frank Codd, passing through Hadley Wickham, and ending at Giorgia Lupi.

The course will be multi-task (learn, make, use, watch, glance, read, dig, listen; see more below) and multi-teacher (I will be assisted by other real and virtual teachers). Some basics in programming and statistics are desirable.

Play

  1. Relational Databases
  2. Data Science
  3. Network Science
  4. Hierarchical Data
  5. Text Mining
  6. Real-time Data
  7. Communicate and collaborate

Task-tag legend

You will go through different tasks: learn, make, use, watch, glance, read, dig, listen. A legend is below:

Software

Books

Datasets

Data challenges

Data challenges have 3 components:

The following are examples of data challenges you are invited to try:

  1. Which are the winners and losers in the last Italian soccer Serie A league? challenge
  2. Which is the best team ever in Italian soccer? challenge
  3. In there a first-mover advantage in chess? challenge
  4. Is child mortality decreasing over time? challenge
  5. Are low quality diamonds more expensive? challenge
  6. Are female dolphins more social than male dolphins? (markup, markdown)
  7. Which are the most powerful countries in the European natural gas market? (markup, markdown)
  8. Detect the most dangerous terrorists involved in Madrid train bombing attack of 2011 (markup, markdown)
  9. Discover the most interdisciplinary and autarchic disciplines in science (markup, markdown)
  10. Detect communities in a Karate club friendship network (markup, markdown)
  11. Attack the resilience of the Madrid train bombing terror network (markup, markdown)

Workshops