Introduction to Natural Language Processing


Figure 1

The Interpretive Loop

Figure 2

Search and Document Summarization

Figure 3

Topic Modeling Graph

Figure 4

Named Entity Recognition

Corpus Development- Text Data Collection


Preparing and Preprocessing Your Data


Figure 1

The Interpretive Loop

Vector Space and Distance


Figure 1

The Interpretive Loop

Figure 2

png

Figure 3

png

Document Embeddings and TF-IDF


Latent Semantic Analysis


Figure 1

The Interpretive Loop

Figure 2

Maps with different projections of the Earth

Figure 3

Image of drop-off of variance explained

Figure 4

Plot results of our LSA model

Figure 5

Plot results of our LSA model, color-coded by author

Figure 6

Plot results of our LSA model, revised with new axis labels

Intro to Word Embeddings


Figure 1

Skipgram

Figure 2

PCA Variance Explained

Figure 3

Visualizing Word Embeddings with PCA

The Word2Vec Algorithm


Figure 1

Single artificial neuron

Figure 2

Linear Decision Boundary

Figure 3

Multilayer neural network

Figure 4

Hierarchical Feature Representations - Face Detection

Figure 5

Skipgram

Figure 6

Word2Vec Model Architecture (Skip-gram)

Figure 7

Image from Word2Vec research paper, by Mikolov et al

Training Word2Vec


Finetuning LLMs


Figure 1

BERT fine-tune

Figure 2

BERT fine-tune

Ethics and Text Analysis