INDEX
Explanations
phrases indicating an action or directive
occurrences of the word "To" in various contexts
New Auto-Interp
Negative Logits
nets
-0.73
hops
-0.66
drops
-0.64
pins
-0.63
soDeliveryDate
-0.63
dips
-0.61
wipes
-0.61
itates
-0.61
taps
-0.60
diapers
-0.60
POSITIVE LOGITS
ilet
1.23
summarize
1.01
oru
0.95
ffee
0.95
reiterate
0.94
asty
0.92
pping
0.90
clarify
0.89
othy
0.87
ppings
0.86
Activations Density 0.067%