INDEX
Negative Logits
rungsseite
-0.64
ViewFeatures
-0.63
Noted
-0.63
styleType
-0.62
jspb
-0.60
astify
-0.57
SourceChecksum
-0.57
Personendaten
-0.57
PreferredItem
-0.57
newswire
-0.55
POSITIVE LOGITS
HAI
0.58
îtra
0.57
but
0.56
But
0.55
arşivlendi
0.55
umpe
0.52
вовсе
0.50
אלא
0.49
haf
0.49
だが
0.49
Activations Density 0.031%