INDEX
Explanations
phrases indicating personal influence or power dynamics within organizations
New Auto-Interp
Negative Logits
Personendaten
-0.82
BeginContext
-0.79
EDEFAULT
-0.76
richTextPanel
-0.74
}}"></
-0.74
NUMX
-0.74
مشين
-0.73
.")
-0.73
ComVisible
-0.71
\\
-0.71
POSITIVE LOGITS
does
1.19
did
1.06
does
1.03
do
0.95
Does
0.92
Does
0.91
did
0.87
do
0.80
DOES
0.79
DOES
0.73
Activations Density 0.316%