Display language
To modulepage Generate PDF

#40804 / #5

SoSe 2023 - WiSe 2023/24

English

DMH Data Management on Modern Hardware

6

Markl, Volker

benotet

Portfolioprüfung

Zugehörigkeit


Fakultät IV

Institut für Softwaretechnik und Theoretische Informatik

34351500 FG Datenbanksysteme und Informationsmanagement (DIMA)

No information

Kontakt


EN 7

Zeuch, Steffen

sekr@tu-berlin.de

Learning Outcomes

Traditionally, database systems managed databases that were primarily stored on secondary storage and only a small part of the data could fit in main memory. Therefore, disk IO was the dominating cost factor. Nowadays, it is possible to equip servers with several terabytes of main memory, which allows us to keep databases in main memory to avoid the disk IO bottleneck. Therefore, the performance of database systems became limited by memory access and processing power. This course will teach students the fundamentals of efficient data processing in main-memory database systems using techniques optimized for main memory (e.g., column stores and query compilation) and modern processor capabilities (e.g., SIMD-based database algorithms, GPU co-processing).

Content

The course is split into two parts, each covering roughly one half of the semester. During the first part, the students learn the fundamentals of cache-efficient storage and processing models. This includes columnar storage and query processing, compression, vector-at-a-time processing, query compilation and transaction processing. In the second part, students learn the basics of parallel data processing on modern CPUs and co-processors (e.g., GPUs) for typical database operators, including optimizations such as SIMD and NUMA-awareness. The course consists of a lecture and theoretical, written exercises.

Module Components

Pflichtgruppe:

All Courses are mandatory.

Course NameTypeNumberCycleLanguageSWSVZ
In-Memory Databases On Modern HardwareIV3435 L 9195SoSeNo information4

Workload and Credit Points

In-Memory Databases On Modern Hardware (IV):

Workload descriptionMultiplierHoursTotal
Participating in Meetings15.04.0h60.0h
60.0h(~2 LP)

Course-independent workload:

Workload descriptionMultiplierHoursTotal
Exam preparation1.030.0h30.0h
Graded problem sheets1.060.0h60.0h
Self-assessment Database Technology15.02.0h30.0h
120.0h(~4 LP)
The Workload of the module sums up to 180.0 Hours. Therefore the module contains 6 Credits.

Description of Teaching and Learning Methods

Lectures are accompanied by individual exercises to practically rehearse the theory taught in the lectures. The course will be given in English.

Requirements for participation and examination

Desirable prerequisites for participation in the courses:

Desirable prerequisites for participation in the courses: This course is an advanced course for master’s students with focus on database systems and information management. In contrast to the introduction of database systems (ISDA Informationssysteme und Datenanalyse), which looks at database systems from an application programmer’s point of view, this class focuses on the data management systems and various optimizations for efficient query processing on modern hardware. It is desirable for students to have completed the Database Technology (DBT) prior to enrolling in DMH. To participate, students are required to have successfully completed a Bachelor’s degree in computer science with a focus on database systems (e.g., DBPRA Datenbankpraktikum and DBPRO Datenbankprojekt). Knowledge of data modeling, relational algebra, and SQL as well as a very good command of Java, or possibly C/C++/C#, programming is required to participate in the course.

Mandatory requirements for the module test application:

This module has no requirements.

Module completion

Grading

graded

Type of exam

Portfolio examination

Type of portfolio examination

100 Punkte insgesamt

Language

German

Test elements

NamePointsCategorieDuration/Extent
(Deliverable Assessment) Homework Exercises20written4 x15h = 60h
(Examination) Quiz 1: (Mid term)40written60 min
(Examination) Quiz 2: (End-of-term)40written60 min

Grading scale

Notenschlüssel »Notenschlüssel 2: Fak IV (2)«

Gesamtpunktzahl1.01.31.72.02.32.73.03.33.74.0
100.0pt95.0pt90.0pt85.0pt80.0pt75.0pt70.0pt65.0pt60.0pt55.0pt50.0pt

Test description (Module completion)

No information

Duration of the Module

The following number of semesters is estimated for taking and completing the module:
1 Semester.

This module may be commenced in the following semesters:
Sommersemester.

Maximum Number of Participants

The maximum capacity of students is 30.

Registration Procedures

Students are required to register for the course in the official TUB examination system within six weeks after commencement of the first lecture or when the first graded assignment is due, whichever happens to be first.

Recommended reading, Lecture notes

Lecture notes

Availability:  unavailable

 

Electronical lecture notes

Availability:  unavailable

 

Literature

Recommended literature
Alfons Kemper, André Eickler Datenbanksysteme. Eine Einführung.10., aktualisierte und erweiterte Auflage, Oldenbourg Verlag, 2015.
Daniel Abadi, Peter A. Boncz, Stavros Harizopoulos, Stratos Idreos, Samuel Madden: The Design and Implementation of Modern Column-Oriented Database Systems. Foundations and Trends in Databases 5(3): 197-280 (2013)
Hasso Plattner. 2014. A Course in In-Memory Data Management: The Inner Mechanics of In-Memory Databases. Second Edition. Springer Publishing Company, Incorporated.
John L. Hennessy, and David A. Patterson. Computer architecture: a quantitative approach. Elsevier, 2012.

Assigned Degree Programs


This module is used in the following Degree Programs (new System):

Studiengang / StuPOStuPOsVerwendungenErste VerwendungLetzte Verwendung
This module is not used in any degree program.

Miscellaneous

No information