INDEX
Explanations
references to historical time periods
New Auto-Interp
Negative Logits
-0.66
a
-0.59
T
-0.59
the
-0.58
two
-0.58
K
-0.56
P
-0.56
ŝ
-0.56
U
-0.56
Z
-0.55
POSITIVE LOGITS
Monfieur
1.17
decade
1.14
+#+
1.13
Majefty
1.11
myſelf
1.07
'\\;'
1.05
expandindo
1.01
Decade
1.00
―――――
1.00
itſelf
0.99
Activations Density 0.079%