INDEX
Explanations
references to feelings of disillusionment or disappointment with leadership
New Auto-Interp
Negative Logits
་་
-1.20
itſelf
-1.20
myſelf
-1.19
Majefty
-1.18
Jefus
-1.17
Efq
-1.17
Theſe
-1.16
ſind
-1.09
клопе
-1.07
―――――
-1.07
POSITIVE LOGITS
↵↵
0.68
.
0.67
<h2>
0.66
)
0.66
),
0.65
).
0.63
$\
0.63
(
0.61
A
0.59
0.59
Activations Density 0.356%