INDEX
Explanations
references to fried food, particularly fried chicken
New Auto-Interp
Negative Logits
PLIED
-0.16
ีย
-0.15
lify
-0.15
понÑıÑĤÑĮ
-0.15
grily
-0.15
ivar
-0.14
ISED
-0.14
åı·
-0.14
plates
-0.14
aides
-0.14
POSITIVE LOGITS
ricks
0.21
¼
0.16
emma
0.15
صÙĨ
0.15
essen
0.15
son
0.14
enheim
0.14
anner
0.14
arton
0.14
ying
0.14
Activations Density 0.017%