INDEX
Explanations
mentions of the term 'Murd*er' with different variations, along with some related terms like 'Daredevil'
occurrences of the word "Murderer" and references to the show "Daredevil."
New Auto-Interp
Negative Logits
conditioning
-0.71
placement
-0.71
Gemini
-0.67
corrective
-0.65
magnification
-0.64
zed
-0.62
condition
-0.62
selective
-0.62
eq
-0.62
graded
-0.62
POSITIVE LOGITS
Murd
1.37
stead
0.91
erer
0.90
erers
0.89
erest
0.81
aby
0.80
aredevil
0.78
anth
0.77
luaj
0.77
ings
0.76
Activations Density 0.013%