INDEX
Explanations
verbs related to directing or adhering to instructions or guidelines
instances of the word "follow."
New Auto-Interp
Negative Logits
wounding
-0.78
orc
-0.66
ukemia
-0.66
pite
-0.63
ldom
-0.62
aucas
-0.62
adin
-0.61
intendent
-0.61
morph
-0.61
obyl
-0.60
POSITIVE LOGITS
follow
1.09
follow
1.04
follows
0.94
Follow
0.94
LLOW
0.81
followed
0.80
SHIP
0.77
ansen
0.72
ĸļ
0.69
Follow
0.67
Activations Density 0.012%