INDEX
Explanations
expressions of determination or commitment to take action
instances of the word "do" in various contexts
New Auto-Interp
Negative Logits
uses
-0.63
mares
-0.61
Rank
-0.61
ãĤ¼ãĤ¦ãĤ¹
-0.58
antha
-0.58
liner
-0.57
lights
-0.57
CLS
-0.56
used
-0.56
being
-0.54
POSITIVE LOGITS
oms
1.03
omsday
0.96
oming
0.96
xx
0.95
ctr
0.92
nothing
0.91
away
0.90
vet
0.90
lez
0.90
something
0.87
Activations Density 0.113%