INDEX
Negative Logits
ArrowToggle
-0.58
Inactive
-0.54
onAnimation
-0.54
vastaan
-0.54
conseguenze
-0.53
nemici
-0.53
ίων
-0.53
veille
-0.52
translates
-0.52
abestanden
-0.51
POSITIVE LOGITS
tartalomajánló
0.61
+),
0.58
")));
0.54
"):
0.53
"];
0.51
)';
0.50
виправивши
0.49
الحره
0.48
]";
0.48
'):
0.48
Activations Density 0.152%