INDEX
Explanations
phrases related to actions that users can perform
the word "can" and its variants in different contexts within the text
New Auto-Interp
Negative Logits
ONSORED
-0.68
Mant
-0.68
soDeliveryDate
-0.66
soType
-0.61
Execution
-0.61
advertising
-0.60
pupil
-0.60
pants
-0.59
killing
-0.58
Ax
-0.57
POSITIVE LOGITS
't
1.33
choose
1.06
easily
1.01
afford
1.00
decide
0.99
adian
0.99
attest
0.99
participate
0.98
learn
0.96
NOT
0.95
Activations Density 0.173%