INDEX
Explanations
time-related phrases
references to the passage of time, specifically in terms of days and years
New Auto-Interp
Negative Logits
XY
-0.74
VID
-0.70
acted
-0.69
alore
-0.65
onis
-0.65
ACTED
-0.65
ERAL
-0.60
ãĥ´ãĤ¡
-0.60
exception
-0.59
allic
-0.59
POSITIVE LOGITS
chool
1.05
creen
1.02
hift
0.93
pring
0.91
ago
0.87
cript
0.84
iblings
0.84
ayers
0.84
mith
0.80
peed
0.78
Activations Density 0.138%