Web Search and Data Mining
Course Description
Web Search and Data Mining (WiSDoM) is an area that aims at extracting knowledge from the largest source of information created by humans: The Web! Throughout this course we will see how this extracted knowledge can solve complex tasks with advanced Computer Vision, Natural Language and Information Retrieval algorithms. The main topics of this course are:
- Text and visual data representation
- Billion scale text and image search
- Visual question answering
- Multimodal conversational assistants
- Recommender systems
This course includes intensive hands-on laboratories where key CV, NLP and IR algorithms are examined.
Objectives
- Learn what is an information embedding.
- Learn the semantic associations between visual data, natural language data and user data.
- Learn how to relate user information needs to actionable data.
- Learn how to do a critical analysis of experimental results.
- Develop autonomous and creative problem solving skills.
Grading
- Exam (40%)
- Project (45% = 15% phase 1 + 20% phase 2 + 10% phase 3)
- Project originality (15%)
Schedule
- 09/mar/21 Introduction
- 16/mar/22 Web document categorization
- 23/mar/22 Word embeddings
- 30/mar/22 Transformer (encoder)
- 06/abr/22 Billion scale indexing
- 13/abr/22 Visual search
- 20/abr/22 Vision-and-language models
- 27/abr/22 ExpoFCT
- 04/mai/22 Multimodal conversational assistants
- 11/mai/22 Pre-trainning and fine-tunning
- 18/mai/22 Transformer (decoder)
- 25/mai/22 Recommender systems
- 01/jun/22 HuggingFace invited lecture
- 08/jun/22 Project discussion / Test
Tutorials
We suggest you to use the account in the lab cluster. However, if you would like to have your own setup, you can follow this guide:
Lecturers
Joao Magalhaes ([email protected] - remove the ‘x’ character to send an email)