INDEX
Explanations
words related to political or social reform
the concept of reform and its various contexts and implications
New Auto-Interp
Negative Logits
Coulter
-0.68
gaard
-0.66
McA
-0.63
Bryant
-0.59
ammy
-0.59
tongues
-0.58
Beir
-0.58
PERSON
-0.57
iffin
-0.57
LIMITED
-0.57
POSITIVE LOGITS
atted
1.10
ulated
1.07
ulation
1.06
atories
0.98
ers
0.96
rats
0.95
ulatory
0.94
ible
0.92
er
0.91
ulations
0.89
Activations Density 0.046%