INDEX
Explanations
dates expressed in the format of a two-digit year followed by a two-digit number
instances of the number "20" in various contexts
New Auto-Interp
Negative Logits
plom
-0.74
pora
-0.73
hei
-0.71
afort
-0.70
ño
-0.70
ablo
-0.65
ãĥ£
-0.64
osal
-0.64
henko
-0.63
kell
-0.62
POSITIVE LOGITS
âĸĪâĸĪ
1.05
committee
0.89
skirts
0.88
th
0.86
200000
0.84
ISH
0.83
61
0.79
%"
0.79
40
0.78
isher
0.78
Activations Density 0.060%