INDEX
Explanations
phrases indicating actions or decisions being taken
occurrences of the verb "do" in various contexts
New Auto-Interp
Negative Logits
mares
-0.65
Frie
-0.57
ascended
-0.56
Rank
-0.56
lights
-0.55
ulative
-0.54
Hung
-0.53
uses
-0.53
hung
-0.51
hog
-0.51
POSITIVE LOGITS
oms
1.00
ppel
1.00
omsday
0.96
oming
0.91
lez
0.86
vet
0.84
xx
0.82
likewise
0.81
nothing
0.81
justice
0.80
Activations Density 0.140%