INDEX
Explanations
tall and slender figure or on his head
New Auto-Interp
Negative Logits
twink
-0.10
çĶ·æĢ§
-0.10
elderly
-0.10
yaÅŁlı
-0.09
lep
-0.09
Adult
-0.09
men
-0.09
leer
-0.09
empt
-0.09
purple
-0.09
POSITIVE LOGITS
fre
0.13
viv
0.12
Wir
0.11
girl
0.10
tom
0.10
ç¬ij
0.10
radi
0.10
andid
0.10
hoy
0.09
laugh
0.09
Activations Density 0.101%