INDEX
Explanations
the word "single" preceded by "every" in the text
the phrase "every single."
New Auto-Interp
Negative Logits
olas
-0.74
orthy
-0.74
eln
-0.71
arty
-0.69
ORK
-0.68
ersive
-0.68
umbledore
-0.66
ual
-0.66
uay
-0.65
exha
-0.65
POSITIVE LOGITS
THING
1.12
digits
0.90
digit
0.88
WHERE
0.83
goddamn
0.79
imaginable
0.78
dollar
0.75
thing
0.74
body
0.73
person
0.72
Activations Density 0.012%