INDEX
Explanations
phrases indicating goals or aims
phrases indicating objectives or purposes
New Auto-Interp
Negative Logits
Appears
-0.69
Torn
-0.69
faces
-0.66
hot
-0.64
note
-0.63
declarations
-0.63
guards
-0.63
Used
-0.63
Kop
-0.61
Lemon
-0.61
POSITIVE LOGITS
maximize
1.14
conserve
1.06
emulate
1.06
stimulate
1.05
minimize
1.04
preserve
1.04
promote
1.04
provide
1.03
achieve
1.01
eliminate
1.01
Activations Density 0.171%