(in Polish) Dane, strumienie, rozpraszanie 1000-2D23DSR
The seminar is devoted to the theory and practice of data management and knowledge representation. We are interested in challenges related to the processing of data, queries, and metadata (schemas, constraints, dependencies, ontologies), ranging from designing and analyzing abstract formalisms all the way to database systems architecture and distributed processing of big data. We like our data in all flavors: not only relational, but also semistructured (XML, JSON), graph (RDF, LPG), object, text, temporal, stream, GIS, and others.
The problems tackled can be theoretical, requiring tools from algorithmics, combinatorics, logic (e.g. finite model theory), and automata theory, as well as very practical, in the spirit of systems and software engineering. MSc theses written within our seminar may study decidability and complexity of abstract problems, design algorithms and heuristics, implement and experiment with existing theoretical solutions, or analyze, compare and extend existing systems.
We meet and discuss with experts in other disciplines, who sometimes supply ideas for MSc theses. We have cooperated or are currently cooperating with astronomers, chemists, and geographers. We are also open for other areas where databases can be applied.
Seminar presentations are usually based on recent papers presented at leading international conferences devoted to data management and knowledge representation, such as VLDB, PODS, SIGMOD, or KR.
Selected topics:
* Data models, semantics, query languages
* Data provenance
* Databases for emerging hardware
* Distributed and parallel databases
* Graph data management, RDF, social networks, Semantic Web
* Knowledge discovery, clustering, data mining
* Machine learning for data management and vice versa
* Model theory, logics, algebras, computational complexity
* Ontology-based data access, data integration and exchange, metadata management
* Ontology formalisms and models, description logics
* Privacy, security, ethics
* Query processing and optimization
* Scientific databases
* Semi-structured data
* Small data, end-user programming
* Storage, indexing, and physical database design
* Streams, sensor networks, complex event processing
* Transaction processing
* Uncertainty, incompleteness, and inconsistency in data management
Type of course
Course coordinators
Learning outcomes
The students prepare and deliver seminar talks (K_U11) prepared on the basis of the newest English-language publications concerning widely understood database topics (K_U14, K_K08), in particular at conferences and in journals endorsed by ACM and IEEE (K_K07).
The student who presents a talk is expected not only to report on the paper, but also to express his/her own opinion on it (K_K06). Members of the audience are expected to participate in the discussion following the presentation (K_K02).
We have a second kind of presentations, those related to the preparation of the Master's Thesis (K_U13). The first such talk is typically given soon after determining the Thesis' topic, and the student is expected to present a plan of how he/she intends to gain the knowledge necessary to prepare the Thesis (K_K01, K_U15,K_K03).
Assessment criteria
First-year students receive credits for the seminar on the basis of two presentations and the registration of the subject of the MS thesis.
Second-year students receive credits for the seminar on the basis of two presentations and submitting their MS thesis.
Bibliography
Conference proceedings of VLDB, PODS, SIGMOD, KR.
Additional information
Additional information (registration calendar, class conductors, localization and schedules of classes), might be available in the USOSweb system: