INDEX
Explanations
time-related phrases, specifically mentioning durations such as "months" or "years"
references to time periods, specifically those that describe events occurring in the recent past
New Auto-Interp
Negative Logits
Franch
-0.66
kaya
-0.66
aband
-0.65
olas
-0.65
\\\\\\\\
-0.65
/#
-0.65
ensical
-0.62
relative
-0.62
halla
-0.62
cil
-0.61
POSITIVE LOGITS
gasp
1.04
month
1.01
week
1.00
year
0.98
ditch
0.98
decade
0.96
ebin
0.91
night
0.87
supper
0.84
Gleaming
0.82
Activations Density 0.063%