INDEX
Explanations
instances of the word "rare."
New Auto-Interp
Negative Logits
fast
-0.58
a
-0.58
ty
-0.55
fle
-0.55
o
-0.54
and
-0.53
High
-0.53
-
-0.52
com
-0.51
-0.50
POSITIVE LOGITS
Majefty
1.25
Anſ
1.16
Jefus
1.10
purpoſe
1.10
ſelves
1.09
greateſt
1.09
itſelf
1.07
Efq
1.04
neceff
1.04
pleaſure
1.03
Activations Density 0.151%