INDEX
Explanations
words related to the concept of movement or progression
the word "go" in various contexts
New Auto-Interp
Negative Logits
accur
-0.65
ammon
-0.62
Aram
-0.62
inh
-0.62
accurately
-0.61
foll
-0.59
elbows
-0.59
creen
-0.58
densely
-0.58
race
-0.58
POSITIVE LOGITS
vernment
0.98
verning
0.95
ardless
0.89
eker
0.87
ogly
0.85
eto
0.84
eff
0.84
ven
0.83
xit
0.82
ption
0.81
Activations Density 0.012%