Back
Semester | fall semester 2023 |
Course frequency | Irregular |
Lecturers |
Benjamin Arold (benjamin.arold@unibas.ch, Assessor)
Claudia Marangon (claudia.marangon@unibas.ch) Alessandra Stampi-Bombelli (alessandra.stampi-bombelli@unibas.ch) |
Content | Much of human knowledge is stored in unstructured formats, in particular in written text. This course teaches methods to process and analyze text data. The learning goals are to understand and implement text-as-data methods, and to evaluate the use of text-analysis tools in economics research. The course covers 10 topics, see below. For most topics, a theoretical lecture will be provided first, followed by a section of applied data analysis (in python), where the students are invited to code as well, and concluded by a discussion of a recent research paper in economics/NLP. The paper discussion will be conducted as a collaborative seminar, where students will take turns to present and discuss the papers. The 10 topics are ordered as follows: 1 Overview 2 Dictionaries 3 Tokenization 4 Distance 5 Topic Models 6 ML with text 7 Word Embeddings 8 Linguistic Parsing 9 Embedding Sequences with Attention 10 Extras (Recent NLP) |
Learning objectives | The learning goals are to understand and implement text-as-data methods, and to evaluate the use of text-analysis tools in economics research. |
Bibliography | Books • Natural Language Processing in Python, Third Edition (“NLTK Book”). o Available at nltk.org/book. o Classic treatments of traditional NLP tools. • Aurelien Geron, Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow (2019) o O’Reilly Book, should be available with an academic account using ETH email. o A great practical book for machine learning and deep learning in Python, but not NLP-focused. We will use material from Chapters 2-4, 7-11, 13, and 15-17. o The deep learning chapters use Keras + TensorFlow. o Jupyter notebooks: https://github.com/ageron/handson-ml2 • Yoav Goldberg, Neural Network Methods for Natural Language Processing (2017) o ETH Library Online Access (email me if this doesn’t work) o A more advanced theoretical treatment of neural networks with an NLP focus, but already somewhat dated. We will use material from Chapters 1-17 and 19. • Jurafsky and Martin, Speech and Language Processing (3d Ed. 2019). o Available here: https://web.stanford.edu/~jurafsky/slp3/ o The standard theory text on computational linguistics. More readings and material will be announced in the course or on request. |
Comments | The lecturers are: Dr. Benjamin Arold (ETH Zürich), Dr. Alessandra Stampi – Bombelli (ETH Zürich), Dr. Claudia Marangon (ETH Zürich) |
Course application | Registration: Please enroll in the Online Services (services.unibas.ch); PhD - students of other Swiss Universities first have to register at the University of Basel BEFORE the start of the course and receive their login data by post (e-mail address of the University of Basel). Processing time up to a week! Detailed information can be found here: https://www.unibas.ch/de/Studium/Mobilitaet.html After successful registration you can enroll for the course in the Online Services (services.unibas.ch). Applies to everyone: Enrolment = Registration for the course and the exam! If you have any questions, please do not hesitate to contact the Graduate School administration at gsbe-wwz@unibas.ch. |
Language of instruction | English |
Use of digital media | No specific media used |
Interval | Weekday | Time | Room |
---|---|---|---|
wöchentlich | Thursday | 12.00-15.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Comments | The lecturers are: Dr. Benjamin Arold (ETH Zürich), Dr. Alessandra Stampi – Bombelli (ETH Zürich), Dr. Claudia Marangon (ETH Zürich) |
Date | Time | Room |
---|---|---|
Thursday 05.10.2023 | 12.00-15.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Thursday 12.10.2023 | 12.00-16.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Thursday 19.10.2023 | 12.00-16.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Thursday 26.10.2023 | 12.00-16.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Thursday 02.11.2023 | 12.00-15.00 | Wirtschaftswissenschaftliche Fakultät, Grosses PC-Labor S18 HG.37 |
Modules |
Modul: Fachlich-methodische Weiterbildung (Doctoral Studies - Faculty of Business and Economics) |
Assessment format | record of achievement |
Assessment details | Take-home assignment. |
Assessment registration/deregistration | Reg.: course registration, dereg: cancel course registration |
Repeat examination | no repeat examination |
Scale | 1-6 0,1 |
Repeated registration | as often as necessary |
Responsible faculty | Faculty of Business and Economics , studiendekanat-wwz@unibas.ch |
Offered by | Faculty of Business and Economics |