INDEX
Explanations
phrases related to historical events, specifically those related to the American Civil War and civil rights movements
New Auto-Interp
Negative Logits
gger
-0.67
scratch
-0.67
ugi
-0.64
dipping
-0.63
spoiler
-0.62
branded
-0.61
sque
-0.61
GY
-0.61
flat
-0.61
cki
-0.61
POSITIVE LOGITS
Liberties
1.46
Rights
1.22
izations
1.13
ian
1.05
izational
1.02
isations
0.96
ized
0.93
War
0.92
ization
0.92
Service
0.91
Activations Density 0.015%