INDEX
Explanations
instances of the word "you" in various contexts
New Auto-Interp
Negative Logits
^(@)
-1.56
myſelf
-1.50
Efq
-1.49
betweenstory
-1.48
BibitemShut
-1.47
Monfieur
-1.43
bibfield
-1.39
themſelves
-1.34
Houſe
-1.33
itſelf
-1.31
POSITIVE LOGITS
'
0.93
)
0.92
),
0.92
,
0.91
-
0.88
...
0.85
.
0.83
↵↵
0.82
.,
0.82
?
0.82
Activations Density 0.346%