Students will be expected to be able to gain enough familiarity to install, run and perform project work on these packages on their own machines.
Projects to be tackled in this course are themed around the topic of language understanding:
Students are expected to install required software (all available freely) on their own machines.
Initial meeting. Syllabus. Parsing methods contd.: LR-parsing.
Lecture Notes: lecture1.pdf / lecture1.pptx (34 slides)
Topics: LR(0) and LR(1) grammars contd. Homework 1. Corpora and N-gram language models. Colorless green ideas and parsing models.
Lecture Notes: lecture2.pdf / lecture2.pptx (51 slides)
Slides updated: 1:18pm Jan 20th
Files: grammar0.pl / lr0.pl / parse.pl / lr1.pl / parse1.pl
Homework 1 Review. Homework 2: install tregex with the Penn Treebank.
Lecture Notes: lecture3.pdf / lecture3.pptx (27 slides)
Slides updated: 12:15pm 1/27.
Colorless Green Ideas revisited. Tregex. Homework 3: small clauses and tregex.
Lecture Notes: lecture4.pdf / lecture4.pptx (30 slides)
Homework 3 review. Statistical parsing and treebanks. Homework 4: install Bikel-Collins.
Lecture Notes: lecture5.pdf / lecture5.pptx (59 slides)
Dan Bikel's reimplementation of Collins' Parser. dbp.zip
MXPOST tagger. BikelCollins parser/trainer. EVALB.
Lecture Notes: lecture6.pdf / lecture6.pptx (39 slides)
Section 23 explained. Homework: training data run.
Lecture Notes: lecture7.pdf / lecture7.pptx (34 slides)
Section 23 sentences: tregex_wsj_23.txt
Recap: Bikel-Collins, tregex and EVALB.
WordNet. Perl package Wordnet Query Data.
Lecture Notes: lecture8.pdf / lecture8.pptx (66 slides)
Slides updated: 12pm 3/9
Spring break. No class.
WordNet on OS X installation issue. Searching WordNet programmatically. Wordnet Homework exercise.
Lecture Notes: lecture9.pdf / lecture9.pptx (35 slides)
WordNet Homework review. 2nd WordNet homework: GRE word/definition matching. Other WordNet topics.
Lecture Notes: lecture10.pdf / lecture10.pptx (56 slides)
Slides updated: 12pm 3/30/16
WordNet Homework hints: WordNet::Similarity online and Perl Module versions. Framenet: lexical units (LU) and Frames. On the Generative Capacity of natural languages.
Lecture Notes: lecture11.pdf / lecture11.pptx (42 slides)
Solutions to the GRE WordNet homework. Factoid Question-Answering. TREC-9 Database. Homework: use parsing, syntactic transformation, internet search and WordNet.
Lecture Notes: lecture12.pdf / lecture12.pptx (33 slides)
(For easier use: code uses m@rk instead if mark and sub found takes two arguments now)
Word vector models and the GRE homework. GloVe.
Lecture Notes: lecture13.pdf / lecture13.pptx (48 slides)
Updated: 12pm 4/20
cosines.py (Compute vector cosines)
matlab.zip (Matlab supporting functions)
Lecture Notes: lecture14.pdf / lecture14.pptx (45 slides)
Quick overview of logic. Propositional logic. Phrase structure grammar
in Prolog. Solving the left recursion problem by lookahead.
Slides updated: 12:30pm 4/27
Grammar developed in class: g.pl
Propositional logic evaluator:
plogic.pl / plogic2.pl / plogic3.pl
Lecture Notes: lecture15.pdf / lecture15.pptx (49 slides)
Lambda calculus. Semantic grammars in Prolog. Montague-style
quantifiers vs. Generalized quantifiers. Upwards/downwards entailment
Grammar developed previously in class: g.pl
Combined Syntax/Semantics grammar developed in class: g2.pl (Updated 12:30pm 5/4/2016)