INDEX
Negative Logits
ILE
-1.64
liders
-0.73
ILES
-0.63
ilet
-0.59
emark
-0.52
ele
-0.51
bat
-0.50
san
-0.50
лет
-0.49
كويكب
-0.49
POSITIVE LOGITS
Efq
0.78
themſelves
0.77
Jefus
0.77
reafon
0.73
Monfieur
0.73
Shakspeare
0.71
itſelf
0.70
reaſon
0.69
pleaſure
0.68
myſelf
0.68
Activations Density 1.695%