DATA MANAGEMENT SYSTEMS

Sabrina SENATORE DATA MANAGEMENT SYSTEMS

0622700106
DEPARTMENT OF INFORMATION AND ELECTRICAL ENGINEERING AND APPLIED MATHEMATICS
EQF7
COMPUTER ENGINEERING
2024/2025

OBBLIGATORIO
YEAR OF COURSE 2
YEAR OF DIDACTIC SYSTEM 2022
AUTUMN SEMESTER
CFUHOURSACTIVITY
648LESSONS
324EXERCISES
Objectives
THE COURSE EXPLORES TECHNOLOGIES AND METHODOLOGIES FOR MANAGING LARGE AMOUNTS OF DATA IN DISTRIBUTED ENVIRONMENTS CHARACTERIZED BY SCALABILITY AND AVAILABILITY ISSUES. METHODOLOGIES AND TOOLS FOR THE FORMAL DESCRIPTION, ANALYSIS AND VALIDATION OF BUSINESS PROCESSES AND THEIR USE IN THE DESIGN OF BUSINESS INFORMATION SYSTEMS ARE PRESENTED.

KNOWLEDGE AND UNDERSTANDING SKILLS.
ADVANCED KNOWLEDGE OF MANAGING LARGE DATA SETS IN CENTRALIZED AND DISTRIBUTED ENVIRONMENTS. NEW ARCHITECTURES FOR DBMS (NOSQL, NEWSQL), DATA MODELS (KEY-VALUE, COLUMN-ORIENTED, DOCUMENT-ORIENTED, GRAPH-ORIENTED, LINKED DATA). DATABASE DESIGN ISSUES IN DISTRIBUTED ENVIRONMENTS. SCALABILITY AND AVAILABILITY. FUNDAMENTALS OF BUSINESS PROCESS MANAGEMENT (BPM). DESIGN AND VALIDATION OF BP MODELS.

ABILITY TO APPLY KNOWLEDGE AND UNDERSTANDING.
ADVANCED USE AND CONFIGURATION OF NEXT-GENERATION DATABASE MANAGEMENT SYSTEMS. DESIGN AND MANAGE DATA DISTRIBUTION IN THE MANNER MOST CONGENIAL TO THE PARTICULAR APPLICATION CONTEXT, ENSURING EFFICIENCY, FLEXIBILITY, AUTONOMY, AND COST CONTAINMENT. USE NEXT-GENERATION DATABASE MANAGEMENT SYSTEMS. HANDS-ON PRACTICE ACTIVITIES FOR USING THE FEATURES OF DIFFERENT NOSQL DBMSS, QUERY EXECUTION AND DATA MANAGEMENT. STUDY OF METHODOLOGIES AND TOOLS FOR BUSINESS PROCESS MODELING.
Prerequisites
NONE
Contents
DETAILED PROGRAM: BY UNIT: (LESSON HOURS/PRACTICE HOURS/LABORATORY HOURS)

UNIT 1. INTRODUCTION TO DATA MANAGEMENT SYSTEMS (4/0/0)
FUNDAMENTAL CONCEPTS OF DBMSS
DATA TYPES AND SPECIFIC PROBLEMS OF LARGE AMOUNTS OF DATA IN DISTRIBUTED ENVIRONMENTS.
SCALABILITY AND AVAILABILITY AS KEY REQUIREMENTS FOR DATA MANAGEMENT SYSTEMS.
OLTP AND OLAP SYSTEMS; ACID PROPERTIES VS. BASIC
OVERVIEW NOSQL SYSTEMS.

UNIT 2. DISTRIBUTED ARCHITECTURES FOR DATABASES (4/0/0)
CONCEPTS AND SPECIFIC ISSUES OF DATABASE DESIGN IN DISTRIBUTED ENVIRONMENTS.

DISTRIBUTED ARCHITECTURES: FRAGMENTATION AND ALLOCATION; LEVELS OF TRANSPARENCY; CLASSIFICATION OF TRANSACTIONS.
DISTRIBUTED QUERY OPTIMIZATION.
TRANSACTIONS IN DISTRIBUTED ENVIRONMENTS. TRANSACTION ISOLATION AND CONCURRENCY CONTROL; CONCURRENCY IN DISTRIBUTED SYSTEMS.

DATA MANAGEMENT IN CLOUD COMPUTING AND BIG DATA ENVIRONMENTS (OUTLINE).
SCALABILITY AND AVAILABILITY OF DATA MANAGEMENT SYSTEMS
HORIZONTAL AND VERTICAL SCALABILITY OF DBMSS AND STRATEGIES FOR DEALING WITH LARGE DATA AMOUNTS.
TWO-PHASE COMMIT PROTOCOL. DATA REPLICATION.
DATA AVAILABILITY MANAGEMENT, HIGH AVAILABILITY AND FAULT TOLERANCE.
DATA REPLICATION TECHNIQUES, PARTITIONING AND LOAD BALANCING.

UNIT 3. NEW DBMS ARCHITECTURES: NOSQL AND NEWSQL. (6/0/0)
ADVANCED DATA MODELS: KEY-VALUE, COLUMN-ORIENTED, DOCUMENT-ORIENTED, GRAPH-ORIENTED AND LINKED DATA. BENEFITS, LIMITATIONS, AND USE CASES OF EACH DATA MODEL.
CAP THEOREM
NOSQL MODELS: DATA MANAGEMENT IN NOSQL SYSTEMS; SHARDING, REPLICATION, QUORUM. NEWSQL.
KEY-VALUE DATABASES: MODELS AND FRAMEWORKS FOR MANAGING BIG DATA
COLUMN DBMS, DATA DISTRIBUTION AND COMPRESSION;
MULTI-MODEL DATABASE

UNIT 4. DOCUMENT ORIENTED DATABASES (6/8/8)
NOSQL AND DOCUMENT-ORIENTED DATABASES;
APPLICATION TO A DATABASE SUCH AS MONGODB; ARCHITECTURE AND FEATURES; QUERY LANGUAGES AND APPLICATIONS.
LABORATORY EXERCISES AND INSIGHTS

UNIT 5. GRAPH ORIENTED DATABASES (6/8/8)
GRAPH ORIENTED DATABASE DATA STRUCTURES; DATA FORMAT AND ARCHIVES;
JSON-BASED DATA DESCRIPTION;
APPLICATION TO A NEO4J: ARCHITECTURE AND CHARACTERISTICS; CYPHER QUERY LANGUAGES. APPLICATIONS AND EXAMPLES



UNIT 6. BUSINESS PROCESS MANAGEMENT (BPM) (4/4/0)

INTRODUCTION TO THE BASIC CONCEPTS OF BPM AND THE PROCESS LIFECYCLE.
METHODOLOGIES AND TOOLS FOR BUSINESS PROCESS MODELING; PATTERNS OF ORCHESTRATION, CHOREOGRAPHY AND COLLABORATION
STANDARD BPMN (BUSINESS PROCESS MODEL AND NOTATION) AND/OR PEL4WS (BUSINESS PROCESS EXECUTION LANGUAGE FOR WEB SERVICES) BUSINESS PROCESS MODELING AND VALIDATION OF BP MODELS
PRACTICAL EXAMPLES AND EXERCISES

UNIT 7. DATA SECURITY AND PRIVACY (2/0/0)
SECURITY POLICIES FOR THE MANAGEMENT OF SENSITIVE DATA.
DATA PROTECTION, AUTHENTICATION, AUTHORIZATION AND ACCESS CONTROL. REGULATORY COMPLIANCE AND DATA PRIVACY (E.G. GDPR).
Teaching Methods
THE COURSE INCLUDES THEORETICAL LESSONS (32H) AND PRACTICAL EXERCISES ON THE COMPUTER (24H) AS WELL AS A LABORATORY PART (16H).
THE LECTURES FOCUS ON THEORETICAL CONCEPTS, METHODOLOGIES AND TECHNOLOGIES RELATED TO DATA MANAGEMENT, DATABASE ARCHITECTURES, DATA MODELS, SCALABILITY AND AVAILABILITY ISSUES.
NOSQL DATABASE ARE STUDIED AND USED IN THE EXERCISES AND IN THE LABORATORY PART.
STUDENTS WILL WORK ON COMPLEX PROJECTS WHILE ALSO TACKLING CHALLENGES RELATED TO DATA MANAGEMENT IN DISTRIBUTED ENVIRONMENTS.
Verification of learning
PROJECT AND ORAL EXAM. THE PROJECT CONSISTS IN APPLYING THE ACQUIRED KNOWLEDGE TO DESIGN AND IMPLEMENT SOLUTIONS FOR DATA MANAGEMENT PROBLEMS. THE ORAL EXAMINATION IS SUPPLEMENTARY AND ALLOWS TO EVALUATE THE STUDENT'S OVERALL PREPARATION.
Texts
PAOLO ATZENI, STEFANO CERI, STEFANO PARABOSCHI AND RICCARDO TORLONE, DATABASE SYSTEMS - CONCEPTS, LANGUAGES AND ARCHITECTURES, MCGRAW HILL.
REDIS DEEP DIVE: EXPLORE REDIS - ITS ARCHITECTURE, DATA STRUCTURES AND MODULES LIKE SEARCH, JSON, AI, GRAPH, TIMESERIES (ENGLISH EDITION) SUYOG DILIP KALE (AUTORE), CHINMAY KULKARNI (AUTORE), BPB
SULLIVAN, DAN, NOSQL FOR MERE MORTALS., ADDISON-WESLEY
MATERIAL PROVIDED BY THE LECTURER
More Information
The course is held in Italian
Lessons Timetable

  BETA VERSION Data source ESSE3 [Ultima Sincronizzazione: 2024-11-29]