INDEX
Explanations
references to smoking products, specifically cigarettes and cigars
New Auto-Interp
Negative Logits
wner
-0.16
emer
-0.15
Mort
-0.15
side
-0.15
entar
-0.14
mort
-0.14
_parms
-0.14
lov
-0.14
robat
-0.14
oha
-0.14
POSITIVE LOGITS
Bernstein
0.16
_allowed
0.14
/cop
0.14
iesel
0.14
Duc
0.14
awan
0.14
ovic
0.14
VENTORY
0.14
awa
0.14
icians
0.14
Activations Density 0.006%