INDEX
Explanations
mentions of geographical locations, particularly cities and universities, especially in the context of news articles
New Auto-Interp
Negative Logits
mble
-1.11
lly
-0.85
umbn
-0.78
xual
-0.77
UCT
-0.76
oglobin
-0.72
ÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤÃĥÃĤ
-0.68
sworth
-0.67
odynam
-0.66
nown
-0.66
POSITIVE LOGITS
Diego
0.90
Oro
0.86
Chargers
0.84
San
0.84
Rican
0.80
Rica
0.79
Unified
0.78
Francisco
0.77
Mate
0.77
Tome
0.76
Activations Density 4.824%