Back to selection
Semester | fall semester 2024 |
Course frequency | Irregular |
Lecturers |
Benjamin Arold (benjamin.arold@unibas.ch, Assessor)
Claudia Marangon (claudia.marangon@unibas.ch) |
Content | «Much of human knowledge is stored in unstructured formats, in particular in written text. This course teaches methods to process and analyze text data. The learning goals are to understand and implement text-as-data methods, and to evaluate the use of text-analysis tools in business and economics research. The course will conclude with an overview of non-standard data in business and economics beyond text, in particular audio data and image data. The course covers 10 topics, see below. For most topics, a theoretical lecture will be provided first, followed by a discussion of a recent research paper in economics/NLP, and concluded by a section of applied data analysis (in python), where the students are invited to code as well. The paper discussion will be conducted as a collaborative seminar, where students will take turns to present and discuss the papers. The 10 topics are ordered as follows: - Overview - Dictionaries - Tokenization & Distance - Unsupervised and Supervised ML with Text - Word Embeddings - Linguistic Parsing - Embedding Sequences with Attention - Extra: Using Transformers for YOUR Research - Audio Data in Business & Economics - Image Data in Business & Economics |
Learning objectives | The learning goals are to understand and implement text-as-data methods, and to evaluate the use of text-analysis tools in economics research. |
Bibliography | Books • Jurafsky and Martin, Speech and Language Processing (3d Ed. 2019). o Available here: https://web.stanford.edu/~jurafsky/slp3/ o The standard theory text on computational linguistics. • Natural Language Processing in Python, Third Edition (“NLTK Book”). o Available at nltk.org/book. o Classic treatments of traditional NLP tools. • Aurelien Geron, Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow (2019) o O’Reilly Book, should be available with an academic account using ETH email. o A great practical book for machine learning and deep learning in Python, but not NLP-focused. We will use material from Chapters 2-4, 7-11, 13, and 15-17. o The deep learning chapters use Keras + TensorFlow. o Jupyter notebooks: https://github.com/ageron/handson-ml2 • Yoav Goldberg, Neural Network Methods for Natural Language Processing (2017) o ETH Library Online Access (email me if this doesn’t work) o A more advanced theoretical treatment of neural networks with an NLP focus, but already somewhat dated. We will use material from Chapters 1-17 and 19. More readings and material will be announced in the course or on request. |
Course application | Registration: Please enroll in the Online Services (services.unibas.ch); PhD - students of other Swiss Universities first have to register at the University of Basel BEFORE the start of the course and receive their login data by post (e-mail address of the University of Basel). Processing time up to a week! Detailed information can be found here: https://www.unibas.ch/de/Studium/Mobilitaet.html After successful registration you can enroll for the course in the Online Services (services.unibas.ch). Applies to everyone: Enrolment = Registration for the course and the exam! If you have any questions, please do not hesitate to contact the Graduate School administration at gsbe-wwz@unibas.ch. |
Language of instruction | English |
Use of digital media | No specific media used |
Interval | Weekday | Time | Room |
---|---|---|---|
täglich | See individual dates |
Comments |
The lecturers are: Dr. Benjamin Arold (ETH Zürich), Dr. Claudia Marangon (ETH Zürich) |
Date | Time | Room |
---|---|---|
Monday 16.12.2024 | 12.15-16.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Tuesday 17.12.2024 | 14.15-18.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Wednesday 18.12.2024 | 14.15-18.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Thursday 19.12.2024 | 12.15-16.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Friday 20.12.2024 | 10.15-14.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Modules |
Modul: Fachlich-methodische Ausbildung (PhD subject: Staatswissenschaften) Modul: Fachlich-methodische Weiterbildung (Doctoral Studies - Faculty of Business and Economics (start of studies before 01.02.2024)) |
Assessment format | record of achievement |
Assessment details | Take-home assignment. |
Assessment registration/deregistration | Reg.: course registration, dereg: cancel course registration |
Repeat examination | no repeat examination |
Scale | 1-6 0,1 |
Repeated registration | as often as necessary |
Responsible faculty | Faculty of Business and Economics , studiendekanat-wwz@unibas.ch |
Offered by | Faculty of Business and Economics |