INDEX
Negative Logits
nearest
-0.69
enthusi
-0.67
populated
-0.65
newcom
-0.64
redes
-0.63
cancellation
-0.63
Cance
-0.62
exha
-0.61
Published
-0.61
princ
-0.61
POSITIVE LOGITS
't
1.84
ÃŃ
1.09
uts
1.00
eness
0.96
n
0.96
´
0.94
etsk
0.92
ned
0.92
hips
0.87
ALD
0.84
Activations Density 0.153%