Display language
To modulepage Generate PDF

#40405 / #2

SS 2014 - SS 2015

English

Database Internals & Scalable Data Processing

6

Markl, Volker

benotet

Portfolioprüfung

Zugehörigkeit


Fakultät IV

Institut für Softwaretechnik und Theoretische Informatik

34351500 FG Datenbanksysteme und Informationsmanagement (DIMA)

No information

Kontakt


EN 7

Zeuch, Steffen

sekr@dima.tu-berlin.de

Learning Outcomes

The global data volume is increasing dramatically each year. Understanding how to store, process and manage these huge amounts of data efficiently is a key requirement for software engineers and data analysts in the modern IT world. This course will teach students both the fundamentals of data processing in traditional single-node database systems and how to scale out these techniques to huge amounts of data in large-scale, distributed environments. Technical skills: 30%; Method skills: 30%; System skills: 30%; Social competence: 10%.

Content

The lecture is split into two parts, each covering roughly one half of the semester. During the first part, the students become acquainted with the fundamentals of query processing in traditional relational database systems. This includes the general architecture of a DBMS, file- & buffer management, query processing, indexing, metadata management, query optimization, locking, recovery and transaction management. In the second half of the lecture, the basics of parallel data processing will be covered with a focus on large-scale, distributed systems and “cloud computing”. Topics include parallel processing platforms like MapReduce, distributed data storage and retrieval – e.g., via DHTs –, techniques for distributed locking and transaction handling, multi-tenancy and software as a service. The course consists of a lecture and theoretical, written exercises. An additional practice project in conjunction with this lecture is offered and recommended to be taken in parallel.

Module Components

Pflichtteil:

All Courses are mandatory.

Course NameTypeNumberCycleLanguageSWSVZ
IDB: Implementation of Database Systems and Scalable Data ProcessingVL0434 L 431WiSeNo information2
IDB: Implementation of Database Systems and Scalable Data ProcessingUE0434 L 431WiSeNo information2

Workload and Credit Points

IDB: Implementation of Database Systems and Scalable Data Processing (VL):

Workload descriptionMultiplierHoursTotal
Participation in class IDB15.02.0h30.0h
30.0h(~1 LP)

IDB: Implementation of Database Systems and Scalable Data Processing (UE):

Workload descriptionMultiplierHoursTotal
Participation in exercises IDB15.02.0h30.0h
30.0h(~1 LP)

Course-independent workload:

Workload descriptionMultiplierHoursTotal
Exam preparation1.030.0h30.0h
Graded problem sheets1.060.0h60.0h
Preparation/Consolidation IDB15.02.0h30.0h
120.0h(~4 LP)
The Workload of the module sums up to 180.0 Hours. Therefore the module contains 6 Credits.

Description of Teaching and Learning Methods

Lectures are accompanied by exercises in small groups to practically rehearse the theory taught in the lectures. The course will be given in English.

Requirements for participation and examination

Desirable prerequisites for participation in the courses:

This course is the base course for master students with focus on database systems and information management and should be attended in the first semester of the master program. In contrast to the introduction of database systems (MPGI5/DBS), which looks at database systems from an application programmers point of view, this class focuses on the internals of database systems. To participate, students are required to have successfully completed a Bachelor in computer science with a focus on database systems (participation in the Datenbankpraktikum, Datenbankprojekt). Knowledge of data modeling, relational algebra, and SQL as well as a very good command of Java, or possibly C/C++/C#, programming is required to participate in the course. Due to capacity reasons, the class is limited to at most 60 participants.

Mandatory requirements for the module test application:

This module has no requirements.

Module completion

Grading

graded

Type of exam

Portfolio examination

Type of portfolio examination

No information

Language

English

Test elements

NamePoints/WeightCategorieDuration/Extent
Assessment of the homework exercises20No informationNo information
Result of the written exam80No informationNo information

Grading scale

No information

Test description (Module completion)

No information

Duration of the Module

The following number of semesters is estimated for taking and completing the module:
1 Semester.

This module may be commenced in the following semesters:
Wintersemester.

Maximum Number of Participants

The maximum capacity of students is 60.

Registration Procedures

No information

Recommended reading, Lecture notes

Lecture notes

Availability:  unavailable

 

Electronical lecture notes

Availability:  unavailable

 

Literature

Recommended literature
[1] Hector Garcia-Molina, Jeffrey D. Ullman, Jennifer Widom: Database Systems - The Complete Book, Pearson Education International, 2002. Or:
[2] Garcia-Molina, Ullman, Widom: “Database Systems: The Complete Book,” Prentice Hall, 2000.
[3] R. Elmasri und S.B. Navathe: Fundamentals of Database Systems, Benjamin Cummings Deutsche Übersetzung: “Grundlagen von Datenbanksystemen,“ Pearson, 2002.
[4] J. Gray, A. Reuter: Transaction Processing, Morgan Kaufman, 1993.
[5] T. Özsu und P. Valduriez: Principles of Distributed Database Systems, Prentice Hall, 1999.
[6] Saake, Heuer, Sattler: “Datenbanken: Implementierungstechniken”, mitp Verlag, 2005 (2. Auflage).
[7] Härder, Rahm: “Datenbanksysteme. Konzepte und Techniken der Implementierung,” Springer, 2. Auflage 2001.
[8] Kemper, Eickler: “Datenbanksysteme – Eine Einführung,” Oldenburg, 5. Auflage 2004.

Assigned Degree Programs


This module is used in the following Degree Programs (new System):

Studiengang / StuPOStuPOsVerwendungenErste VerwendungLetzte Verwendung
This module is not used in any degree program.

Students of other degrees can participate in this module without capacity testing.

Miscellaneous

No information