INDEX
Explanations
phrases indicating a duration of time or a waiting period
phrases indicating duration and ongoing actions
New Auto-Interp
Negative Logits
Secondly
-0.73
secondly
-0.72
lihood
-0.70
Shutterstock
-0.68
later
-0.67
tomorrow
-0.67
[+
-0.65
iscons
-0.64
*/(
-0.63
similarity
-0.63
POSITIVE LOGITS
maintained
0.91
waged
0.91
resided
0.88
tirelessly
0.88
uninterrupted
0.88
dogged
0.87
relentlessly
0.87
altern
0.86
steadfast
0.85
steadily
0.85
Activations Density 0.354%