INDEX
Negative Logits
eworld
-0.07
차
-0.07
aso
-0.06
bitk
-0.06
endi
-0.06
nella
-0.06
.");↵
-0.06
obia
-0.06
usta
-0.06
figure
-0.06
POSITIVE LOGITS
Puppy
0.09
puppy
0.08
Murphy
0.07
startX
0.06
ynchronize
0.06
pup
0.06
stumble
0.06
puppies
0.06
separation
0.06
прик
0.06
Activations Density 0.002%