INDEX
Explanations
phrases indicating the occurrence or announcement of events or situations
New Auto-Interp
Negative Logits
iets
-0.16
ensen
-0.15
ayer
-0.15
ritch
-0.14
521
-0.14
\Active
-0.13
.nextSibling
-0.13
cg
-0.13
jmu
-0.13
thereafter
-0.13
POSITIVE LOGITS
follows
0.28
heels
0.27
follow
0.23
hot
0.22
follow
0.20
heals
0.20
hot
0.20
Follow
0.18
heel
0.18
FOLLOW
0.18
Activations Density 0.020%