INDEX
Explanations
words related to time, particularly references to the upcoming year
references to time periods, particularly the upcoming year
New Auto-Interp
Negative Logits
gotten
-0.91
esse
-0.69
uese
-0.66
ById
-0.65
nered
-0.64
76561
-0.63
ocker
-0.62
apsed
-0.60
anecd
-0.59
Dys
-0.59
POSITIVE LOGITS
onwards
0.70
ruary
0.69
lins
0.68
mie
0.67
fw
0.66
Learn
0.66
2019
0.65
iatus
0.65
å§«
0.64
cture
0.62
Activations Density 0.039%