Zur Modulseite PDF generieren

#41135 / #3

SoSe 2022 - SoSe 2023

Englisch

ROC Foundations for Graduate Research in Data Management and Machine Learning Systems

9

Markl, Volker

Benotet

Portfolioprüfung

Englisch

Zugehörigkeit


Fakultät IV

Institut für Softwaretechnik und Theoretische Informatik

34351500 FG Datenbanksysteme und Informationsmanagement (DIMA)

Keine Angabe

Kontakt


EN 7

Markl, Volker

sekr@dima.tu-berlin.de

Lernergebnisse

Big Data (BD) and Machine Learning (ML) are key drivers underlying the current wave of innovation in artificial intelligence and data science. Indeed, these drivers have had a profound impact on both the economy and the sciences. This course targets research-oriented students who aim to pursue a PhD in Big Data Management or Data Science and Engineering Systems and Technologies. Upon completion of this course, students will have learned about contemporary research methodology, gained both theoretical and practical skills in data management and big data technologies, and be attuned to today’s major research challenges in scalable data management and processing. The course is designed to principally impart technical skills (30%), method skills (40%), systems skills (10%), and social skills (20%).

Lehrinhalte

The central focus of this module is on contemporary research methodology (CRM), data management technologies, and current research challenges. After an initial presentation on CRM, including scientific reading, writing, and presenting, in subsequent lectures, students will read about foundational data management methods/technologies and offer a presentation, which will then be followed by an instructor led presentation addressing related advanced topics. Topics of discussion, include data storage and indexing, specification and compilation of data analysis programs, query optimization and self-tuning, adaptive methods, processing data science pipelines as well as responsible data management. The course will also include a lab component, where students analyze and evaluate discussed methods, technologies, and settings in a methodical and scientific way.

Modulbestandteile

Pflichtbereich

Die folgenden Veranstaltungen sind für das Modul obligatorisch:

LehrveranstaltungenArtNummerTurnusSpracheSWS ISIS VVZ
Research Oriented Course (ROC) on Data Science and Engineering Systems and TechnologiesIV0434 L 479WiSeen6

Arbeitsaufwand und Leistungspunkte

Research Oriented Course (ROC) on Data Science and Engineering Systems and Technologies (IV):

AufwandbeschreibungMultiplikatorStundenGesamt
Lab Course (Experimental Setup)15.02.0h30.0h
Lab Course (Performance Evaluation)15.04.0h60.0h
Lab Course (Programming)15.02.0h30.0h
Lab Course (System Setup)15.02.0h30.0h
Plenary Sessions15.04.0h60.0h
Preparation (including Reading, Literature Search, and Presentations)15.02.0h30.0h
Report15.02.0h30.0h
270.0h(~9 LP)
Der Aufwand des Moduls summiert sich zu 270.0 Stunden. Damit umfasst das Modul 9 Leistungspunkte.

Beschreibung der Lehr- und Lernformen

This Integrated Course (Integrierte Veranstaltung, IV) consists of: (i) lectures on key concepts, (ii) discussions, (iii) student lead presentations (including literature search), and (iv) a practical project element. Active participation and contributions to all parts of this course are essential.

Voraussetzungen für die Teilnahme / Prüfung

Wünschenswerte Voraussetzungen für die Teilnahme an den Lehrveranstaltungen:

Computer science topics addressed in TU Berlin modules in the Bachelor’s curriculum, particularly, both ISDA (Information Systems and Data Analysis) and DBPRA (Practical Database Systems Lab) or their equivalents, as well as good programming skills in C, Java, and SQL are all required. Additionally, an undergraduate course in linear algebra, probability, and statistics. This course will be offered in English. Thus, fluency in English is also required.

Verpflichtende Voraussetzungen für die Modulprüfungsanmeldung:

Dieses Modul hat keine Prüfungsvoraussetzungen.

Abschluss des Moduls

Benotung

Benotet

Prüfungsform

Portfolioprüfung

Art der Portfolioprüfung

100 Punkte insgesamt

Sprache(n)

Englisch

Prüfungselemente

NamePunkteKategorieDauer/Umfang
Experimentation Presentation20mündlich20min
Technology Presentation20mündlich20min
(Examination) Written Mid-term Test/Quiz30schriftlichKeine Angabe
(Deliverable Assessment) Final Report30schriftlichKeine Angabe

Notenschlüssel

Notenschlüssel »Notenschlüssel 2: Fak IV (2)«

Gesamtpunktzahl1.01.31.72.02.32.73.03.33.74.0
100.0pt95.0pt90.0pt85.0pt80.0pt75.0pt70.0pt65.0pt60.0pt55.0pt50.0pt

Prüfungsbeschreibung (Abschluss des Moduls)

The portfolio exam (worth 100 points) is comprised of three parts: (i) technology presentation (30 points), (ii) experimentation presentation (30 points), and (iii) a final report (40 points). The final grade will be computed according to the Grading Table 2 of Faculty IV, according to German law, § 47 (2) AllgStuPO TU Berlin.

Dauer des Moduls

Für Belegung und Abschluss des Moduls ist folgende Semesteranzahl veranschlagt:
1 Semester.

Dieses Modul kann in folgenden Semestern begonnen werden:
Wintersemester.

Maximale teilnehmende Personen

Die maximale Teilnehmerzahl beträgt 8.

Anmeldeformalitäten

Keine Angabe

Literaturhinweise, Skripte

Skript in Papierform

Verfügbarkeit:  nicht verfügbar

 

Skript in elektronischer Form

Verfügbarkeit:  nicht verfügbar

 

Literatur

Empfohlene Literatur
Readings in Database Systems, 5th Edition, Peter Bailis, Joseph M. Hellerstein, Michael Stonebraker, editors, http://www.redbook.io/
Various Research Papers, made available during the first lecture
Hadoop: The Definitive Guide (4th Edition), Tom White, O’Reilly Media, 2015.
Raj Jain: The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling (Wiley Professional Computing), 1991
Supplementary reading material may be assigned to complement course lectures.

Zugeordnete Studiengänge


Diese Modulversion wird in folgenden Studiengängen verwendet:

Studiengang / StuPOStuPOsVerwendungenErste VerwendungLetzte Verwendung
Dieses Modul findet in keinem Studiengang Verwendung.

Sonstiges

Keine Angabe