INDEX
Explanations
references to a specific company or brand
New Auto-Interp
Negative Logits
a
-1.10
I
-1.06
S
-1.00
de
-0.98
in
-0.96
to
-0.96
-0.96
C
-0.95
N
-0.94
,
-0.94
POSITIVE LOGITS
Shakspeare
1.57
itſelf
1.55
་་
1.54
Jefus
1.49
Reſ
1.49
Anſ
1.47
Diſ
1.47
ſelf
1.46
Majefty
1.45
doubtnut
1.44
Activations Density 0.217%