INDEX
Explanations
terms related to resistance in various contexts
New Auto-Interp
Negative Logits
the
-0.55
al
-0.52
ichen
-0.52
mar
-0.51
ate
-0.50
he
-0.50
↵↵
-0.50
majority
-0.50
bater
-0.50
-0.50
POSITIVE LOGITS
Efq
1.43
itſelf
1.32
Jefus
1.30
myſelf
1.24
Majefty
1.18
ſelf
1.18
Theſe
1.13
poffible
1.13
auffi
1.12
purpoſe
1.12
Activations Density 0.104%