Kurs im Selbststudium

In-Memory Data Management

Angeboten von Prof. Hasso Plattner
In-Memory Data Management

The „In-Memory Data Management" MOOC in 2017 is the fifth iteration of Prof. Hasso Plattner's successful introduction into the inner mechanics of this recent technology. It is a repetition of the 2015 course and builds on the same, revised material. The course focuses on the management of enterprise data in column-oriented in-memory databases. Latest hardware and software trends led to the development of a new revolutionary database technology that enables flexible and lightning-fast analysis of massive amounts of enterprise data.

Seit 18. November 2017 im Selbststudium
Kurssprache: English
中文, Čeština, български, Русский, Português, Polski, Nederlands, Italiano, Magyar, English, Ελληνικά, Deutsch
Big Data and AI, Enterprise Computing, Expert

Kursinformationen

The „In-Memory Data Management" MOOC in 2017 is the fifth iteration of Prof. Hasso Plattner's successful introduction into the inner mechanics of this recent technology. It is a repetition of the 2015 course and builds on the same, revised material. The course focuses on the management of enterprise data in column-oriented in-memory databases. Latest hardware and software trends led to the development of a new revolutionary database technology that enables flexible and lightning-fast analysis of massive amounts of enterprise data. The basic concepts and design principles of this technology are explained in detail. Beyond that, the implications of the underlying design principles for future enterprise applications and their development are discussed. The MOOC will explain in detail the differences and advantages of an in-memory column-oriented database in contrast to traditional row-oriented disk-based storages.

        • Requirements for Modern Enterprise Computing, Enterprise Application Characteristics
        • Hardware Trends, Columnar Storage vs. Row Storage
        • Dictionary Encoding, Compression
        • Scans, Selects, Deletes, Inserts, Updates
        • Indices, Joins
        • Applications

Participants of this course will receive access to the extensive reading material accompanying the lectures.

Lernmaterial

  • Week 1:

    The first week will give you an understanding of origins of enterprise computing. It is vital to know the historic development which lead to the emergence of current hardware as we know it now in order to understand the decisions made in the past. Many characteristics of current applications, like materialized aggregates and a reduction of detail in the stored information, have their roots in the past. While these measures were helpful in former systems, they form an obstacle which has to be overcome now in order to allow for new, dynamic applications.
  • Week 2:

    Within the second week, the differences between a horizontal, row oriented layout and a columnar layout are discussed. Concepts like compression and partitioning are introduced. Based on that, you will get an explanation of the internal steps performed inside the database to carry out the fundamental relational operations insert, update and delete. The week concludes with a fundamental difference of SanssouciDB to most other databases: the insert only approach. Following this concept, we circumvent several pitfalls concerning referential integrity and additionally gain the foundation for a gap-less time travel feature.
  • Week 3:

    The content of week 3 focuses on more advanced structures and operations within the database. The differential buffer, a means to prevent frequent resorting of the dictionaries and rewriting of the attribute vectors, is explained in further detail. Subsequently, also the merge process, which incorporates the changes from the differential buffer into the main store, is illustrated. The retrieval of information via the select statement, as well as related concepts like tuple reconstruction, early and late materialization, or a closer examination of the achieved scan speed, are also part of this week's schedule. The description of the join operation, which is used to connect information from different tables, concludes this week.
  • Week 4:

    Week 4 is all about aggregation. Aggregations are the centerpiece of every business analytics application. Given that huge impact of aggregates on all parts of a business, it is of great importance to understand what aggregate functions are, why we remove all materialized aggregates and go for aggregation on the fly. You will further learn how to greatly reduce the costs of this on demand approach by using the aggregate cache and understand its connection to the differential buffer and the merge process.
  • Week 5:

    Week 5 sheds light on some more inner mechanisms of the database. What happens in emergency situations, when for example the power is turned off? Logging and recovery are vital parts to know in order to understand why an in-memory database is as secure as a traditional disk based one. Further, the benefits of replicas are explained. We conclude the week with an outlook onto the implications that arise with the tremendously increased speed at hands.
  • Week 6:

    Week 6 will give some insights going further than the internal workings of a database. We will have a look into trending topics such as machine learning and cryptocurrencies and will relate these recent trends to databases and their applicability within (enterprise) computing.
  • Exam:

    The final exam will cover all content from the previous weeks and test your understanding of the course as a whole.

Für diesen Kurs einschreiben

Der Kurs ist kostenlos. Legen Sie sich einfach ein Benutzerkonto auf openHPI an und nehmen Sie am Kurs teil!
Jetzt einschreiben

Lernende

Aktuell
Heute
9.956
Kursende
18. November 2017
5.825
Kursstart
18. September 2017
4.683

Bewertungen

Der Kurs wurde mit durchschnittlich 4.84 Sternen bei 49 abgegebenen Stimmen bewertet.

Anforderungen für Leistungsnachweise

  • Das Zertifikat erhält, wer es gebucht hat und den Leistungsnachweis erhalten hat. Mehr Informationen finden Sie in den Richtlinien für das Zertifikat.
  • Den Leistungsnachweis erhält, wer in der Summe aller benoteten Aufgaben mindestens 50% der Höchstpunktzahl erreicht hat.
  • Die Teilnahmebestätigung erhält, wer auf mindestens 50% der Kursunterlagen zugegriffen hat.

Mehr Informationen finden Sie in den Richtlinien für Leistungsnachweise.

Dieser Kurs wird angeboten von

Prof. Hasso Plattner

Prof. Dr. h.c. Hasso Plattner is the chair of the "Enterprise Platform and Integration Concepts" research group at HPI. His research group focuses mainly on in-memory data management for enterprise applications and human-centered software design.

He is co-founder of SAP SE, where he served as the CEO until 2003 and has since been chairman of the supervisory board. SAP SE is the leading provider of enterprise software solutions. In his role as chief software advisor, he concentrates on defining the mid- and long-term technology strategy and direction of SAP.