INDEX
Explanations
time-related mentions with a focus on durations or sequences
New Auto-Interp
Negative Logits
Cosponsors
-0.66
irs
-0.66
uristic
-0.64
orthy
-0.64
Percent
-0.60
irlfriend
-0.59
oiler
-0.58
idad
-0.57
orporated
-0.56
azz
-0.56
POSITIVE LOGITS
researching
0.74
trial
0.73
conversation
0.72
production
0.72
succeeding
0.71
struction
0.69
totality
0.69
succession
0.67
Lent
0.67
conversations
0.66
Activations Density 10.468%