INDEX
Explanations
references to criminal activity or investigations related to political figures
New Auto-Interp
Negative Logits
CreateTagHelper
-1.35
doubtnut
-1.31
purpoſe
-1.25
Efq
-1.23
Majefty
-1.23
Shakspeare
-1.21
myſelf
-1.21
Jefus
-1.20
Anſ
-1.17
་་
-1.17
POSITIVE LOGITS
0.66
,
0.64
(
0.61
<eos>
0.61
in
0.58
:
0.57
N
0.56
A
0.56
να
0.54
.
0.54
Activations Density 0.152%