INDEX
Explanations
conjunctions and prepositions connected with law and political contexts
New Auto-Interp
Negative Logits
late
-0.87
raq
-0.84
ãĥī
-0.74
geist
-0.72
mun
-0.72
iers
-0.72
cow
-0.70
mers
-0.68
lish
-0.66
estate
-0.66
POSITIVE LOGITS
knowing
1.03
regard
1.02
mentioning
0.99
hesitation
0.97
interruption
0.95
sacrificing
0.94
exception
0.93
necessarily
0.93
specifying
0.90
recourse
0.89
Activations Density 0.872%