INDEX
Explanations
words related to authority, power dynamics, and legal proceedings
important actions or consequences related to events, particularly in political or social contexts
New Auto-Interp
Negative Logits
Revision
-0.64
postwar
-0.63
Kush
-0.62
1948
-0.61
1906
-0.59
notwithstanding
-0.59
Crunch
-0.59
Alley
-0.58
Huff
-0.58
Ballard
-0.58
POSITIVE LOGITS
tnc
0.84
DCS
0.81
]);
0.78
>]
0.76
aeus
0.75
Reviewer
0.75
taboola
0.75
erd
0.72
');
0.72
>>>>
0.69
Activations Density 0.569%