INDEX
Explanations
words related to war and conflict
words related to the concept of "arrest."
New Auto-Interp
Negative Logits
ĸļ
-0.86
¬¼
-0.82
e
-0.69
assetsadobe
-0.67
ĨĴ
-0.66
eries
-0.65
er
-0.64
insula
-0.63
éĹĺ
-0.63
ĪĴ
-0.62
POSITIVE LOGITS
riors
1.06
beit
1.04
thur
1.04
ctica
1.03
riage
1.01
acter
1.00
acters
0.99
ithmetic
0.98
ashtra
0.94
allel
0.92
Activations Density 0.053%