INDEX
Explanations
official statements, declined comment
New Auto-Interp
Negative Logits
=?,
-1.13
sitive
-1.02
_;
-0.94
ڤ
-0.94
même
-0.91
đây
-0.90
addafi
-0.90
there
-0.88
/>
-0.88
минера
-0.87
POSITIVE LOGITS
spokes
1.09
spokespersons
0.91
declined
0.90
representatives
0.88
astral
0.84
需要
0.84
مؤرشف
0.83
and
0.83
demar
0.82
なし
0.79
Activations Density 0.008%