INDEX
Explanations
references to spans of time, particularly in number form
numerical values representing quantities or durations
New Auto-Interp
Negative Logits
abbre
-0.66
Caption
-0.65
moderator
-0.64
rike
-0.64
turnout
-0.61
ashtra
-0.60
Ahead
-0.58
encer
-0.57
imaru
-0.56
llor
-0.55
POSITIVE LOGITS
years
1.11
weeks
1.07
months
1.05
generations
1.02
decades
0.99
consecutive
0.98
days
0.98
months
0.97
minutes
0.97
hours
0.96
Activations Density 0.180%