INDEX
Explanations
the beginning of a document or section header
New Auto-Interp
Negative Logits
TagMode
-0.98
pleaſure
-0.95
houſe
-0.94
TextAppearance
-0.94
ſtate
-0.94
fubject
-0.93
raiſ
-0.90
itſelf
-0.90
purpoſe
-0.88
saraba
-0.88
POSITIVE LOGITS
0.73
“
0.59
a
0.57
"
0.55
<eos>
0.53
the
0.53
must
0.52
ha
0.45
an
0.45
もん
0.45
Activations Density 0.053%