INDEX
Explanations
geographical locations and political or military terms
New Auto-Interp
Negative Logits
owe
-0.76
urai
-0.66
Doctrine
-0.63
talk
-0.61
orr
-0.61
ipeg
-0.56
antage
-0.56
acks
-0.56
oos
-0.55
vantage
-0.54
POSITIVE LOGITS
by
1.23
BY
1.06
by
1.03
By
0.90
bys
0.90
ĸļ
0.89
aback
0.84
By
0.84
chiefly
0.81
solely
0.79
Activations Density 1.080%