INDEX
Explanations
words related to smoking
mentions and discussions surrounding smoking
New Auto-Interp
Negative Logits
assian
-0.88
Vector
-0.74
ousse
-0.69
HCR
-0.69
yss
-0.68
oteric
-0.68
ensional
-0.67
UFC
-0.67
Offic
-0.66
ngth
-0.66
POSITIVE LOGITS
cessation
1.39
smoking
1.10
cigarettes
1.10
smoker
1.08
smoked
1.02
smoke
0.96
habits
0.95
cigars
0.91
smokers
0.91
cig
0.90
Activations Density 0.016%