INDEX
Negative Logits
iffies
-0.16
es
-0.15
ÃŃcio
-0.15
atchet
-0.14
ei
-0.14
Grade
-0.14
aat
-0.14
enant
-0.14
edList
-0.14
/Error
-0.14
POSITIVE LOGITS
rosse
0.37
quer
0.31
oste
0.26
asse
0.21
ROS
0.19
moid
0.18
unar
0.18
Cros
0.17
uesta
0.17
nung
0.17
Activations Density 0.003%