INDEX
Explanations
proper nouns related to political figures, events, and historical contexts
New Auto-Interp
Negative Logits
ailable
-0.70
PLIED
-0.63
ous
-0.59
ulhu
-0.58
ously
-0.58
generic
-0.57
ises
-0.57
ruck
-0.56
raints
-0.56
insula
-0.55
POSITIVE LOGITS
Lyndon
0.66
Johnson
0.62
ILCS
0.62
Johnson
0.61
Bib
0.60
tract
0.59
vard
0.59
tracts
0.57
Shoals
0.57
eston
0.57
Activations Density 8.330%