INDEX
Negative Logits
-ahụ
-0.08
mily
-0.08
nagh
-0.08
grime
-0.08
mildew
-0.08
删
-0.08
june
-0.08
hollywood
-0.08
污
-0.08
alcoholism
-0.08
POSITIVE LOGITS
meals
0.09
effect
0.08
fencing
0.08
folding
0.08
Effect
0.08
repas
0.08
overhead
0.08
'effet
0.07
Yoga
0.07
Ko
0.07
Activations Density 0.002%