INDEX
Explanations
the word "risk" and the word "magazine," perhaps in different contexts
New Auto-Interp
Negative Logits
-1.28
<eos>
-1.28
the
-1.24
-1.22
a
-1.16
↵
-1.10
in
-1.09
↵↵
-1.05
(
-1.03
"
-1.03
POSITIVE LOGITS
Efq
2.52
Jefus
2.52
Monfieur
2.39
myſelf
2.34
purpoſe
2.28
Majefty
2.27
pleaſure
2.25
itſelf
2.25
houſe
2.19
iſt
2.17
Activations Density 1.715%