INDEX
Explanations
references to the passage of time or historical duration
New Auto-Interp
Negative Logits
ervo
-0.15
adow
-0.15
Äįné
-0.15
Hour
-0.14
hurst
-0.14
nelly
-0.14
illy
-0.13
oken
-0.13
dope
-0.13
icas
-0.13
POSITIVE LOGITS
years
0.54
ages
0.50
Ages
0.40
Years
0.40
years
0.40
Years
0.40
decades
0.38
YEARS
0.37
ages
0.35
yrs
0.35
Activations Density 0.085%