INDEX
Explanations
mentions of "Bart" and variations of the word "art."
New Auto-Interp
Negative Logits
ÏĥÏī
-0.15
ock
-0.15
otherapy
-0.14
اعÙĬ
-0.14
kuk
-0.14
ignet
-0.14
eval
-0.14
osen
-0.14
ÑĥлÑı
-0.14
itz
-0.14
POSITIVE LOGITS
AsStream
0.16
alars
0.15
ecast
0.15
urum
0.15
ereotype
0.15
opleft
0.15
abra
0.14
otherwise
0.14
edly
0.14
еÑĤа
0.14
Activations Density 0.004%