INDEX
Explanations
references to historical events and changes in society
New Auto-Interp
Negative Logits
Yesterday
-0.17
recently
-0.16
Yesterday
-0.14
eldom
-0.14
utures
-0.14
Recently
-0.14
minul
-0.14
æĺ¨
-0.13
rix
-0.13
oned
-0.13
POSITIVE LOGITS
until
0.27
until
0.23
gradually
0.23
during
0.22
Until
0.22
Until
0.21
Beginning
0.21
beginning
0.20
Grad
0.20
Beginning
0.20
Activations Density 0.208%