INDEX
Explanations
mentions of specific locations or organizations
references to Chicago
New Auto-Interp
Negative Logits
regress
-0.64
dividend
-0.63
schild
-0.61
MENT
-0.60
WARD
-0.58
revelation
-0.58
tert
-0.57
respects
-0.57
Scythe
-0.56
neut
-0.56
POSITIVE LOGITS
oga
0.92
oland
0.82
uan
0.77
oen
0.76
ulhu
0.76
otta
0.74
atti
0.73
ega
0.72
ractor
0.72
amac
0.71
Activations Density 0.100%