INDEX
Explanations
phrases related to continuous or ongoing action
the word "doing" in various contexts
New Auto-Interp
Negative Logits
lights
-0.69
Tier
-0.69
mare
-0.67
mares
-0.66
)=(
-0.66
flags
-0.65
liner
-0.65
pter
-0.64
case
-0.63
sg
-0.63
POSITIVE LOGITS
berman
0.80
omething
0.79
pez
0.77
nothing
0.76
something
0.74
ggy
0.73
VIDIA
0.73
女
0.73
terribly
0.71
lyak
0.71
Activations Density 0.036%