INDEX
Negative Logits
髀
0.33
duur
0.32
रक्तदान
0.31
coinage
0.31
اشت
0.31
potentialities
0.31
sembl
0.30
revistas
0.30
लेखा
0.30
diversión
0.30
POSITIVE LOGITS
apologize
0.77
apologise
0.66
recommend
0.63
hope
0.58
purposely
0.55
noticed
0.54
apologies
0.54
purposefully
0.54
understand
0.52
believe
0.52
Activations Density 0.169%