INDEX
Explanations
mentions of different types of headwear
New Auto-Interp
Negative Logits
otten
-0.17
TECTED
-0.17
erap
-0.17
esser
-0.15
evin
-0.15
ÛĮÙĨÚ©
-0.15
ifr
-0.15
inalg
-0.15
ingers
-0.15
vou
-0.15
POSITIVE LOGITS
/head
0.16
-head
0.16
owel
0.15
andi
0.15
Head
0.14
Head
0.14
izons
0.14
Lid
0.14
sey
0.14
帽
0.14
Activations Density 0.026%