COURSE OBJECTIVES AND OUTCOMES
The course participants will learn how to collect and manage data about their everyday life to become researchers of themselves. The general objective is to allow the participants to gain a practical understanding of how data about their everyday life are related to AI. The participants have the opportunity to explore both technical questions about data quality and data generation, and ethical and privacy aspects. The course covers all the phases of the data life cycle: collection, preparation, documentation and distribution. After a brief theoretical introduction, the participants will collect behavioural data about themselves using a mobile application (available for Android and iOS) called iLog and developed by the Knowdive group at the University of Trento. The app collects phone sensor data and sends questions to the participants at regular intervals. Then, they will prepare and document their data. The course is data-intensive and hands on. The participants are free to decide whether to collect their own data or to work on data collected from previous studies. However, the collection and management of one’s own data is strongly suggested to fully understand and become aware of the challenges and the potential of handling one’s own personal data. The overall objective of this course is to familiarise with the management of personal data and to raise awareness about the value and the impact of one own’s data.
COURSE DESCRIPTION
This course will cover the following topics:
- Why and how to collect personal data
- What type of data can be collected with mobile devices
- How to prepare the data to reduce the cost of data use and reuse 3
- How to document and distribute data
COURSE MODALITY
The Fall 2024 Edition of the course is delivered in the classrooms. The idea behind this course is that the students do most of the work during the course duration because this should yield considerably better results.
Learning how to handle one’s own data involves working on real personal data. Data collection is then a crucial part of the whole process. For this reason, the students of the course will be taught and asked to collect data from their own smartphones via an application developed by the Knowdive group, ideally over around one month. The calendar shows the dates of the data collection and when the data will be made available (dates can still change). Participants unwilling to collect their personal data will work on data collected in past data collections. More details will be provided by the professors during the first lessons.
Theory:
- The lessons will cover the basic aspects of data collection, data management and data distribution, mainly from a computer science perspective but also with some notions from social science.
Practice:
- Participants will collect and process their own data. During the course, they will be guided through the various phases and asked to work on their data at home. Multiple lessons are allocated to answer questions and address issues about the project.
Modality:
- The project work will run in parallel with the theory lessons. After the completion of a theory module, there is a practical application to one own’s data. During each theory lesson, there will be Q&A sessions about its content, and some lessons will be fully allocated to Q&A related to previous content and project progress.
EXAM
The exam will consist of presenting the results of the study in a short presentation during the last lesson. The required deliverables (templates to be provided) are:
- pseudonymized, cleaned and prepared dataset (only for evaluation purposes and removed from the course storage after the evaluation process);
- metadata catalog with the metadata of the pseudonymized dataset (catalog example). The catalog is a static webpage hosted by the participant and can be private or public. If the visualization is set to private, remember to give permission to the professor to access the webpage.
- static website describing the data collection and the data preparation
CALENDAR AND MATERIAL
The course runs from September 9 2024 till November 26, 2024 with the following schedule
- Monday, 10:30-13:30, Room A216 Povo 1
- Tuesday, 14:30-16:30, Room A216 Povo 1
Course Features
- Lecture 0
- Quiz 0
- Duration 48 hours
- Skill level Beginner
- Language English
- Students 273
- Assessments Yes
Requirements
- Because of the practical parts, a basic knowledge of Python programming is strongly suggested.
- Basic knowledge of data science, ethics, and data governance is useful but not mandatory
Target audiences
- The course is taught as a module of the Studies on Human Behaviour course, part of the Data Science degree at the University of Trento.