INDEX
Negative Logits
Sandy
-0.08
Say
-0.08
sap
-0.08
Say
-0.07
co
-0.07
wachten
-0.07
ج
-0.07
advert
-0.07
Yo
-0.07
YO
-0.07
POSITIVE LOGITS
nervous
0.11
�
0.09
elm
0.09
Kingston
0.08
most
0.08
तः
0.08
fem
0.07
hollywood
0.07
gir
0.07
artery
0.07
Activations Density 0.011%