INDEX
Explanations
instances of the word "possession."
mentions of possession-related offenses
New Auto-Interp
Negative Logits
htt
-0.70
MQ
-0.67
externalToEVAOnly
-0.64
mitt
-0.63
bank
-0.62
prep
-0.61
Redd
-0.61
resc
-0.61
ser
-0.61
marg
-0.60
POSITIVE LOGITS
ivity
1.01
possession
0.94
ership
0.93
ive
0.89
iveness
0.88
ibility
0.88
iture
0.86
ession
0.85
essor
0.85
alogue
0.84
Activations Density 0.036%