INDEX
Explanations
phrases indicating a specific action or purpose
phrases indicating purpose or intention
New Auto-Interp
Negative Logits
Appears
-0.83
listed
-0.80
heavy
-0.69
done
-0.68
auga
-0.67
lime
-0.64
Ĭ
-0.63
Required
-0.63
checked
-0.63
Needs
-0.63
POSITIVE LOGITS
maximize
1.16
fulfill
1.15
satisfy
1.09
achieve
1.08
facilitate
1.07
promote
1.05
minimize
1.05
create
1.03
compensate
1.03
avoid
1.02
Activations Density 0.066%