INDEX
Explanations
temporal indicators related to events or actions
New Auto-Interp
Negative Logits
Efq
-0.90
Cæsar
-0.86
himſelf
-0.80
Majefty
-0.78
Jefus
-0.77
ſelf
-0.75
purpoſe
-0.71
ſtate
-0.71
Monfieur
-0.71
Shakspeare
-0.70
POSITIVE LOGITS
the
1.40
they
1.29
we
1.20
it
1.09
you
1.02
there
0.93
he
0.93
a
0.86
an
0.78
0.72
Activations Density 0.151%