INDEX
Explanations
references to periods of time, particularly years and months
New Auto-Interp
Negative Logits
atif
-0.17
odus
-0.15
ailable
-0.14
ADA
-0.14
anus
-0.14
byn
-0.14
assis
-0.14
\`
-0.14
elman
-0.14
Scalars
-0.14
POSITIVE LOGITS
ago
0.53
ago
0.40
Ago
0.38
ego
0.28
AGO
0.28
back
0.28
go
0.26
age
0.25
назад
0.24
go
0.22
Activations Density 0.027%