INDEX
Explanations
mentions of months, particularly their names and related numerical representations
New Auto-Interp
Negative Logits
ToWorld
-0.16
û
-0.15
imus
-0.15
Ñıж
-0.14
paque
-0.14
agna
-0.14
infeld
-0.14
alph
-0.14
upo
-0.14
érica
-0.14
POSITIVE LOGITS
nox
0.15
lich
0.15
uma
0.14
arnation
0.14
Ro
0.14
double
0.14
erras
0.13
_ast
0.13
ams
0.13
ocol
0.13
Activations Density 0.014%