INDEX
Negative Logits
保護
-0.07
vae
-0.07
Belediye
-0.07
bir
-0.07
stru
-0.07
Cumhur
-0.06
Parks
-0.06
_DIP
-0.06
Strawberry
-0.06
kolo
-0.06
POSITIVE LOGITS
Baltimore
0.07
_exact
0.06
Final
0.06
fwd
0.06
genre
0.06
card
0.06
FINAL
0.06
ngr
0.06
org
0.06
Joined
0.06
Activations Density 0.007%