INDEX
Explanations
references to the comic book character "The Punisher"
mentions of the character "Punisher"
New Auto-Interp
Negative Logits
ORGE
-0.79
MO
-0.78
externalActionCode
-0.78
ELD
-0.75
afety
-0.73
gow
-0.73
ALTH
-0.72
RAFT
-0.72
ickr
-0.69
AKING
-0.69
POSITIVE LOGITS
isher
1.00
pun
1.00
Pun
0.97
cheon
0.97
ishment
0.91
gments
0.86
ishable
0.83
cture
0.82
secut
0.81
kat
0.79
Activations Density 0.010%