INDEX
Explanations
references to smoking and tobacco products
New Auto-Interp
Negative Logits
ingham
-0.17
jeme
-0.14
微软éĽħé»ij
-0.14
uite
-0.14
atorio
-0.14
cuis
-0.13
aptors
-0.13
Kv
-0.13
/msg
-0.13
caler
-0.13
POSITIVE LOGITS
cigarettes
0.56
tobacco
0.53
cigarette
0.52
Tobacco
0.45
smoking
0.44
smokers
0.43
nicotine
0.42
smoker
0.42
cig
0.41
cigar
0.40
Activations Density 0.041%