Computer workshops 4018-CW5
1. Text file formats and character coding systems
Coding systems: ASCII (American Standard Code for Information Interchange), ISO 8859-n (International Standard Organization), UTF-8. Converting files between formats. Multikey – free utility for writing in many different alphabets in Unicode.
Exercises: installing the Greek alphabet (defining a keyboard shortcut for the language); virtual keyboard (as a start); vowel accent marks (ά, ύ, ί, ό).
2. Corpuses of texts. Regular expressions.
Definition of a corpus; types of corpus; purpose of corpuses.
General characteristics of the Corpus of the Polish Language at the Institute of Computer Science of the Polish Academy of Sciences (KIPI at IPI PAN).
Searching in KIPI. Rudiments of regular expressions. Searching with the help of regular expressions.
Searches in KIPI involving grammatical data. The attributes [orth], [base], [pos], and [tag].
Exercises: advanced queries to the corpus.
Analysis of HTML documents. Tags: HTML, HEAD, META, BODY, DIV, P, SPAN, B, I, U, TABLE, TR, TD, IMG, A.
Exercises/homework: Creating a simple Web page using HTML.
Defining a styling file. Basic CSS attributes.
Exercises/homework: Formatting HTML documents using CSS.
5. Introduction to XML
The concept of metalanguage. The structure of XML documents. Definition of XML documents using DTD or XML Schema. Editing XML documents.
Exercises/homework: designing a document schema.
6. Introduction to XSLT
Converting XML documents to HTML using XSLT.
Exercises/homework: transforming XML files into Web pages.
7. Text Encoding Initiative (TEI). Examples of XML application in the humanities.
TEI – tagging standard for the humanities. TEI Guidelines – description of the main tags.
Examples of XML application. Corpus of the Polish Language (IPI PAN). Corpus of the texts and correspondence of Dantiscus (IBI UW).
8. Tagging texts according to a schema.
9. Introduction to relational databases. Overview of database systems: MS Access, MySQL, Oracle.
The concepts of relation, Cartesian product, table, and record. Representation of a database model using ERD (entity-relationship diagram).
10. Introduction to SQL.
Creating tables using database administration interfaces. Simple select queries in SQL (SELECT).
11. Designing databases.
Exercises: designing a simple database.
12. Generating database applications.
Oracle APEX. MS Access forms.
Exercises/homework: creating Web applications in Oracle APEX.
13. Generating database applications – continued
The concept of a formula. Mathematical and textual functions. Sorting.
Type of course
Exercises: creating a spreadsheet. Processing data using formulas. Creating a graph from spreadsheet data.
- awareness of the diversity of digital file formats and character coding systems; performing conversions between formats and systems; setting the keyboard of one’s choice in Windows
- constructing a simple regular expression
- using the search engine of the text corpuses covered by the course
- analysing and modifying an existing Web page and CSS styling sheet; creating a simple Web page
- identifying basic XML concepts; being aware of the textual data processing capabilities of XML and XSLT
- being aware of the existence of tagging standards such as TEI; experience with tagging texts according to TEI guidelines
- understanding these notions: database, relation, table, record
- understanding an ERD diagram and using it to create tables in a database system
- knowing what SQL is used for; understanding the notion of a query; building a simple SELECT query
- creating a form for editing table data in MS Access or Oracle APEX
- creating a spreadsheet, entering data and performing calculations using formulas
Completing exercises and homework 60%
In selected cases – final oral test
Wimmer, Paweł, Akademickie narzędzia Microsoft Word 2007, 2012.
Wolański, Adam, Edycja tekstów. Praktyczny poradnik. Warszawa 2008, Wydawnictwo Naukowe PWN
Pomoc ze stron MS Office.
Information on level of this course, year of study and semester when the course unit is delivered, types and amount of class hours - can be found in course structure diagrams of apropriate study programmes. This course is related to the following study programmes:
- Modern greek philology, full-time, first cicle programme
- Cultural studies – mediterranean civilization, full-time, first cicle programme
Additional information (registration calendar, class conductors, localization and schedules of classes), might be available in the USOSweb system: