INDEX
Explanations
phrases indicating the beginning or starting of something
phrases indicating the initiation or progression of events or conditions
New Auto-Interp
Negative Logits
oversaw
-0.70
Palest
-0.65
CI
-0.65
avoided
-0.64
done
-0.64
cares
-0.63
keeping
-0.62
supervised
-0.61
congratulated
-0.60
didn
-0.60
POSITIVE LOGITS
crumble
1.30
emerge
1.24
unfold
1.22
fade
1.20
explode
1.13
dawn
1.12
trickle
1.11
sink
1.11
dissolve
1.08
fray
1.08
Activations Density 0.162%