INDEX
Explanations
specific time periods or durations
phrases indicating time durations or periods of employment
New Auto-Interp
Negative Logits
pour
-0.74
certain
-0.72
hazard
-0.67
Maker
-0.67
cript
-0.67
aque
-0.67
sufficient
-0.66
VP
-0.66
eny
-0.66
sure
-0.65
POSITIVE LOGITS
inka
0.67
terday
0.67
ithub
0.66
itures
0.65
ishers
0.65
proven
0.64
alde
0.63
unda
0.63
Til
0.63
soever
0.63
Activations Density 0.028%