INDEX
Explanations
phrases related to direct actions or directives
the term "Direct" and its variations, indicating a focus on direct content or services
New Auto-Interp
Negative Logits
OAD
-0.86
Schne
-0.77
anners
-0.71
Haram
-0.71
FORE
-0.69
EAR
-0.68
glers
-0.68
Kaufman
-0.66
Vance
-0.65
OPLE
-0.65
POSITIVE LOGITS
ions
1.04
iate
1.02
ives
0.95
edly
0.92
ories
0.89
eur
0.84
enture
0.83
lined
0.82
irect
0.80
ing
0.80
Activations Density 0.018%