INDEX
Explanations
words associated with instructions or directives
occurrences of the word "to."
New Auto-Interp
Negative Logits
listed
-0.66
soDeliveryDate
-0.65
hops
-0.64
pointers
-0.61
afety
-0.60
hur
-0.59
croft
-0.57
Got
-0.56
resy
-0.56
need
-0.56
POSITIVE LOGITS
pload
1.05
maximize
0.96
avoid
0.96
compensate
0.96
minimize
0.93
give
0.91
eliminate
0.90
keep
0.90
satisfy
0.89
fulfill
0.88
Activations Density 1.202%