Basic Knowledge of Python
이 과정은 언어학 자나 프로그래머를 Python 에서 NLP로 소개합니다. 이 과정에서 우리는 주로 nltk.org (Natural Language Tool Kit)를 사용할 것이지만 NLP와 관련하여 유용한 다른 라이브러리도 사용할 것입니다. 현재 우리는 Python 2.x 또는 Python 3.x에서이 과정을 수행 할 수 있습니다. 예는 영어 또는 만다린 (普通话)입니다. 예약 전에 동의하면 다른 언어도 사용할 수 있습니다.
Machine Translated
Overview of Python packages related to NLP
Introduction to NLP (examples in Python of course)
- Simple Text Manipulation
- Searching Text
- Counting Words
- Splitting Texts into Words
- Lexical dispersion
- Processing complex structures
- Representing text in Lists
- Indexing Lists
- Collocations
- Bigrams
- Frequency Distributions
- Conditionals with Words
- Comparing Words (startswith, endswith, islower, isalpha, etc...)
- Natural Language Understanding
- Word Sense Disambiguation
- Pronoun Resolution
- Machine translations (statistical, rule based, literal, etc...)
- Exercises
NLP in Python in examples
- Accessing Text Corpora and Lexical Resources
- Common sources for corpora
- Conditional Frequency Distributions
- Counting Words by Genre
- Creating own corpus
- Pronouncing Dictionary
- Shoebox and Toolbox Lexicons
- Senses and Synonyms
- Hierarchies
- Lexical Relations: Meronyms, Holonyms
- Semantic Similarity
- Processing Raw Text
- Priting
- Struncating
- Extracting parts of string
- Accessing individual charaters
- Searching, replacing, spliting, joining, indexing, etc...
- Using regular expressions
- Detecting word patterns
- Stemming
- Tokenization
- Normalization of text
- Word Segmentation (especially in Chinese)
- Categorizing and Tagging Words
- Tagged Corpora
- Tagged Tokens
- Part-of-Speech Tagset
- Python Dictionaries
- Words to Propertieis mapping
- Automatic Tagging
- Determining the Category of a Word (Morphological, Syntactic, Semantic)
- Text Classification (Machine Learning)
- Supervised Classification
- Sentence Segmentation
- Cross Validation
- Decision Trees
- Extracting Information from Text
- Chunking
- Chinking
- Tags vs Trees
- Analyzing Sentence Structure
- Context Free Grammar
- Parsers
- Building Feature Based Grammars
- Grammatical Features
- Processing Feature Structures
- Analyzing the Meaning of Sentences
- Semantics and Logic
- Propositional Logic
- First-Order Logic
- Discourse Semantics
- Managing Linguistic Data
- Data Formats (Lexicon vs Text)
- Metadata