INDEX
Negative Logits
cientes
0.65
θεν
0.61
thankfully
0.60
ваи
0.60
પ્રમાણે
0.59
sichtbar
0.58
snart
0.57
ترام
0.57
okin
0.57
oling
0.56
POSITIVE LOGITS
have
1.49
having
1.47
Have
1.37
Have
1.32
having
1.32
avoir
1.29
being
1.23
Having
1.22
had
1.21
have
1.21
Activations Density 0.251%