Display language
To modulepage Generate PDF

#40310 / #5

SoSe 2020 - WiSe 2021/22

English

Advanced Information Management 2 - Management of Data Streams

6

Markl, Volker

benotet

Portfolioprüfung

Zugehörigkeit


Fakultät IV

Institut für Softwaretechnik und Theoretische Informatik

34351500 FG Datenbanksysteme und Informationsmanagement (DIMA)

No information

Kontakt


EN 7

Borusan, Alexander

sekr@dima.tu-berlin.de

Learning Outcomes

Through the technological advances in the last few years more and more applications are being created that constantly generate data which is only relevant for a certain time frame. Because of this, this type of application has to be able to handle various streams of data. You will gain conceptual, methodological and practical skills in the area of processing data streams, by using examples from various application areas.

Content

In recent years, advances in hardware technology have facilitated new ways of collecting data continuously. In many applications such as for instance network monitoring, the volume of such data is so large that it may be impossible to store the data on disk. Furthemore, even when the data can be stored, the volume of the incoming data may be so large that it may be impossible to process any particular record more than once. Therefore, many database operations and data analysis algorithms such as for instance filtering, indexing, classification and clustering become significantly more challenging in this context. The course has the following main topics: - Basic conceptual understanding and terminology of data streams management, introduction to data streams, the difference to classical data management, examples (telephone networks, automotive electronics, avionics, medical, transport management, building monitoring, etc.) - Basic concepts of technical information systems, modeling of data streams - Data sources, requirements elicitation, requirements structuring, requirements of data stream management systems (DSMS) - Reference architecture of a DSMS, architecture modeling - Modeling of the functionality, logical architecture. Description on technical architecture, interface definition, behavior modeling - Data streams processing: Windowing, The Sliding-Window Computation Model and Results - Synopsis Construction in Data Streams (Sampling, Wavelets, Sketches and Histograms) - Filtering, counting in data streams - Data streams analysis: Classification & Clustering - Data processing in sensor networks, resource utilization, transmission and transfer costs - Modeling examples (automotive electronics, avionics). Prototype Systems (Aurora, STREAM, TelegraphCQ). Frameworks (Flink, Spark, Storm, Samza, SAMOA)

Module Components

Pflichtteil:

All Courses are mandatory.

Course NameTypeNumberCycleLanguageSWSVZ
Advanced Information Management 2 - Management of Data StreamsIV0434 L 471WiSeEnglish4

Workload and Credit Points

Advanced Information Management 2 - Management of Data Streams (IV):

Workload descriptionMultiplierHoursTotal
Labs / Project Work15.04.0h60.0h
Lecture15.02.0h30.0h
Seminar Presentations + Reporting15.04.0h60.0h
Study of Materials15.02.0h30.0h
180.0h(~6 LP)
The Workload of the module sums up to 180.0 Hours. Therefore the module contains 6 Credits.

Description of Teaching and Learning Methods

AIM-2 (Advanced Information Management 2 - Management of Data Streams) consists of 4 teaching elements: lectures, in parallel integrated phases of student presentations (individual work) and labs work (group projects plus home work), plus intensive study of literature.

Requirements for participation and examination

Desirable prerequisites for participation in the courses:

BSc in Computer Science / Computer Engineering (or similar); good knowledge in 'Databases & Information Systems', 'Software Engineering', and Mathematics

Mandatory requirements for the module test application:

This module has no requirements.

Module completion

Grading

graded

Type of exam

Portfolio examination

Type of portfolio examination

100 Punkte insgesamt

Language

English

Test elements

NamePointsCategorieDuration/Extent
Project Work/Homework (small teams = 3 max)20practical30 hours
Seminar Talk (individual)30oral20-30 minutes
Written Project Report35written20 pages
Written Seminar Report (individual)15written8-10 pages

Grading scale

Notenschlüssel »Notenschlüssel 2: Fak IV (2)«

Gesamtpunktzahl1.01.31.72.02.32.73.03.33.74.0
100.0pt95.0pt90.0pt85.0pt80.0pt75.0pt70.0pt65.0pt60.0pt55.0pt50.0pt

Test description (Module completion)

The exam of this module consists of 4 portfolio elements: 3 'deliverable assessements' ("Ergebnisprüfungen"): seminar talk, seminar report, and project report plus one 'learning process review' ("Lernprozess-Evaluierung"): project (lab & home) work evaluation

Duration of the Module

The following number of semesters is estimated for taking and completing the module:
1 Semester.

This module may be commenced in the following semesters:
Wintersemester.

Maximum Number of Participants

The maximum capacity of students is 30.

Registration Procedures

Before semester start, you must announce your interest to the "DIMA Anmeldetool" ('DIMA pre-semester registration tool')! Within 6 weeks after commencement of the class (but before the first result is to be delivered), you must offiicially register in QISPOS (TUB examination protocol system')

Recommended reading, Lecture notes

Lecture notes

Availability:  unavailable

 

Electronical lecture notes

Availability:  unavailable

 

Literature

Recommended literature
Data Streams: Models and Algorithms. Ed. by Charu C. Aggarwal, Springer, 2007 als Basisliteratur, daneben zu jedem Themenkomplex klassische und aktuelle Forschungspapiere.
J. Leskovec, A.Rajaraman, J.D.Ullman. Mining of Massive Data Sets. Cambridge University Press. 2014, ISBN: 9781107077232

Assigned Degree Programs


This module is used in the following Degree Programs (new System):

Studiengang / StuPOStuPOsVerwendungenErste VerwendungLetzte Verwendung
This module is not used in any degree program.

Miscellaneous

No information