INDEX
Explanations
references to control or dominance in interpersonal or market contexts
New Auto-Interp
Negative Logits
sville
-0.17
ÑĢÑĥн
-0.15
Alb
-0.15
acc
-0.15
ÑĢаÑĩ
-0.15
mue
-0.14
mdb
-0.14
QE
-0.14
使
-0.14
iams
-0.14
POSITIVE LOGITS
covered
0.38
Covered
0.34
covered
0.33
-covered
0.24
figured
0.24
COVER
0.21
licked
0.21
peg
0.20
Cover
0.20
down
0.18
Activations Density 0.095%