INDEX
Negative Logits
elcome
0.57
chromosome
0.55
Words
0.55
cata
0.55
าท
0.53
EPC
0.53
Richards
0.53
Electric
0.52
पाण्याची
0.52
Welcome
0.52
POSITIVE LOGITS
მან
0.54
led
0.54
fed
0.54
pol
0.53
rate
0.53
danger
0.52
CAUSED
0.50
mùi
0.49
seemed
0.49
hinge
0.48
Activations Density 0.000%