INDEX
Negative Logits
AddTagHelper
-0.76
findpost
-0.60
poussière
-0.58
taxia
-0.58
tangentMode
-0.57
тельству
-0.56
preuves
-0.56
TagMode
-0.55
guruan
-0.55
Приступљено
-0.55
POSITIVE LOGITS
account
0.58
variations
0.52
individual
0.52
changes
0.50
the
0.48
rocking
0.47
body
0.47
BorderRadius
0.47
}{*}{0.46
personal
0.46
Activations Density 0.002%