GENERAL DESCRIPTION
This course will cover the following topics:
- the main issues which can be addressed when data and knowledge resources have to be integrated.
- a general methodology (iTelos) for knowledge and data analysis, modeling and integration.
- an analysis of the state of the art tools and methodologies for data analysis, modeling and integration.
- an introduction to ontologies, Extended ER models and linguistic resources.
This is a hands-on, lab and experiment based course. Students will be given a data analysis/modelling/integration problem that they will have to solve, possibly, while taking the class. During the experiment, students will have to apply to the problem the notions introduced in class. The students splitted in teams, where each team will solve an integration problem adopting the methodology taught during the lectures.
INSTRUCTION
This KDI course is the same course taught at the University of Trento in the academic year of 2021/2022. This course is taught in the presence in class of both lecturers and students, primarily as a hands-on lab course. Passing the exam amounts to developing a project, which ultimately will lead to generating a Knowledge Graph (and support documentation) starting from data that will have to be found on the Web. This goal will be reached under the continuous supervision of the lectures providing advice and support, and in collaboration, doing joint work with a colleague taking this course. There are no easy or cost-effective ways to achieve this goal without a continuous presence in class. Given the current Covid situation, if someone taking the course cannot be in class for a particular lecture, that lecture will be registered and made available to him/ her. The registration request should be done as soon as possible before the beginning of the lecture (ideally 2-3 days before) and supported by a valid justification. Notice, however, that these registered lectures will unlikely have the same quality as the physical lecture, particularly for classes that will consist of one-to-one interactions between the students and the lecturers. In most cases, additional interaction with the lecturers during the Q&A lectures will achieve a better goal (see below).
The lectures will follow the scheduling indicated in the Calendar and Material section. The course material includes slides, demo videos, support resources, and links, and all are provided on the website under the Calendar and Material section. For those interested, it is possible to consult the registrations of the KDI Trento lectures of the A.Y. 2020/2021 (here). This might be occasionally useful but with the following two points of attention: (1) while being very similar in spirit to the last year, the course this year presents some substantial differences, all exploiting that the lectures are in presence, and (2) in most cases we suggest you ask the lecturers for feedback or suggestions. In particular, to help students, after each phase of the methodology taught in the KDI course, there will be a Q&A lecture in which the students can ask questions about all their open problems and doubts.
At the end of the course, students will be asked to complete an online questionnaire about the overall process and methodology they will have learned. This feedback is essential to us as it is the basis for continuous evolution and improvement of the methodology being taught. To this extent, students are strongly encouraged to raise doubts, ask questions, and discuss their doubts about the methodology during the Q&A lectures.
LEARNING OUTCOMES
The Knowledge and Data Integration course aims to providing motivations, definitions, theorems and techniques for a concrete and effective understanding of what (in the context of computer science) is meant for knowledge and data integration. Providing also, techniques for analyzing and modelling knowledge and data as well as techniques for data and knowledge integration. Stimulating the students to continue their career with higher interest into data and knowledge representation in their own field of expertise, and to produce computer-processable solutions of relevant problems.
CALENDAR AND MATERIAL
The course runs from Sep, 6, 2022 till Dec 20, 2022 with the following schedule
- Tuesday, 16:00-16:45, Room 3A-103
- Tuesday, 16:45-18:25, Room 3A-103
Course Features
- Lecture 0
- Quiz 0
- Duration 56 hours
- Skill level All levels
- Language English
- Students 273
- Assessments Yes
Requirements
- Not formal pre-requisites but the following will help a lot:
- Data management: basic programming skills in python and/or java/javascript
- Databases modeling: ER modeling, (Ontology modeling if possible, Ontology definition desirable)
- Attitude to teamwork
Target audiences
- The intended target are the graduate or undergraduate students of the Department of Information and Computer Science, School of Information Technology and Electornics (SITE) of the National University of Mongolia.