INDEX
Explanations
instances of the word "recommend" and its variations, indicating a focus on suggestions or endorsements
New Auto-Interp
Negative Logits
quin
-0.17
بار
-0.17
zeug
-0.16
awan
-0.15
arde
-0.15
Blasio
-0.15
chers
-0.15
uf
-0.15
istically
-0.14
aps
-0.14
POSITIVE LOGITS
strongly
0.27
/request
0.26
against
0.21
ively
0.21
ìĤ¬íķŃ
0.19
atory
0.19
entially
0.19
/prom
0.18
ive
0.18
highly
0.18
Activations Density 0.069%