INDEX
Explanations
phrases or sentences in quotes within parentheses
instances of opening parentheses or quotation marks in the text
New Auto-Interp
Negative Logits
)'
-0.62
?'
-0.61
,'
-0.54
shocking
-0.54
TPPStreamerBot
-0.54
gard
-0.52
auga
-0.52
manag
-0.51
.'
-0.50
Bengal
-0.50
POSITIVE LOGITS
("3.51
["
2.05
('2.05
("1.75
/"
1.72
—"
1.58
(=
1.56
(#
1.46
([
1.44
($
1.41
Activations Density 0.013%