INDEX
Explanations
time-related expressions, specifically referring to past periods
temporal expressions related to durations, particularly years and months
New Auto-Interp
Negative Logits
inav
-0.76
emort
-0.74
hett
-0.71
uctions
-0.71
ashtra
-0.71
atche
-0.68
acted
-0.65
corridors
-0.63
estine
-0.63
ibaba
-0.61
POSITIVE LOGITS
ago
1.04
long
0.96
dozen
0.90
Ago
0.89
apiece
0.85
night
0.75
glass
0.73
dream
0.72
nd
0.68
eteenth
0.67
Activations Density 0.089%