INDEX
Explanations
references to freedom and individual rights
New Auto-Interp
Negative Logits
Оно
-0.58
szönöm
-0.55
мәкал
-0.53
自分は
-0.53
They
-0.53
CompilerServices
-0.52
MonoBehaviour
-0.52
postIndex
-0.52
They
-0.51
私は
-0.51
POSITIVE LOGITS
our
4.37
our
2.96
nosso
2.72
Our
2.70
我们的
2.69
我們的
2.60
nossa
2.52
unserer
2.50
nossos
2.48
Our
2.48
Activations Density 1.774%