INDEX
Explanations
mentions of a person evading or dealing with situations involving conflict or consequences
New Auto-Interp
Negative Logits
)=(
-0.73
Methods
-0.67
Reviewer
-0.66
Dispatch
-0.64
Handling
-0.64
ONSORED
-0.63
Prosecutor
-0.61
rification
-0.60
DOC
-0.59
mus
-0.59
POSITIVE LOGITS
omsday
1.37
ppel
1.29
herty
1.25
gging
1.09
lez
1.07
ctors
1.05
ctr
1.04
berman
1.03
ozy
1.02
ggie
1.01
Activations Density 0.048%