INDEX
Explanations
mention any form of visual content description or caption
bullet points or lists
New Auto-Interp
Negative Logits
son
-0.64
hift
-0.63
bda
-0.61
blackout
-0.60
creen
-0.58
bearer
-0.58
laund
-0.58
mans
-0.57
hal
-0.57
artificially
-0.56
POSITIVE LOGITS
CONTIN
0.82
=-
0.79
+---
0.76
======
0.75
WATCHED
0.73
--------------------------------------------------------
0.72
oiler
0.72
Chapters
0.71
--------------------
0.71
=~=~
0.70
Activations Density 0.064%