INDEX
Explanations
verbs related to actions or events happening sequentially
New Auto-Interp
Negative Logits
voy
-0.76
vil
-0.74
utor
-0.71
ldom
-0.71
flo
-0.69
adin
-0.69
tu
-0.69
uci
-0.68
ukong
-0.67
ancing
-0.66
POSITIVE LOGITS
suit
1.08
closely
0.94
suit
0.88
ĸļ
0.84
footsteps
0.73
logically
0.70
faithfully
0.66
Suit
0.65
SHIP
0.65
:]
0.64
Activations Density 0.032%