INDEX
Explanations
words related to specific dates or times
temporal or conditional phrases
New Auto-Interp
Negative Logits
shedding
-0.70
slowing
-0.70
shock
-0.69
invasion
-0.68
infiltration
-0.67
rental
-0.67
stride
-0.66
envy
-0.66
prone
-0.66
infiltr
-0.66
POSITIVE LOGITS
Else
1.06
Us
0.92
Things
0.86
Measure
0.84
Mine
0.83
Lives
0.83
Places
0.83
Different
0.83
Own
0.81
humans
0.80
Activations Density 0.170%