INDEX
Explanations
mentions of locations, including states, cities, and universities
expressions of inability or restrictions
New Auto-Interp
Negative Logits
suspic
-0.61
agents
-0.61
©¶æ
-0.60
agent
-0.59
Gujar
-0.55
Pakistani
-0.54
mans
-0.53
hid
-0.51
ISI
-0.51
utsu
-0.51
POSITIVE LOGITS
rebuilt
0.71
rebuilding
0.63
oother
0.63
rebuild
0.60
Congratulations
0.59
goodbye
0.58
Horizons
0.58
bye
0.57
postponed
0.55
Tomorrow
0.55
Activations Density 2.418%