INDEX
Explanations
words related to tobacco use and its consequences
New Auto-Interp
Negative Logits
躇
-0.63
Abbey
-0.62
Gopal
-0.61
Lé
-0.59
Yeh
-0.58
Farrar
-0.58
Cambio
-0.56
Zag
-0.56
Gust
-0.55
therlands
-0.55
POSITIVE LOGITS
cigarettes
0.95
tobacco
0.93
toilet
0.88
snee
0.81
chimney
0.79
bathroom
0.78
Toilet
0.77
toilet
0.76
cigarette
0.75
sneezing
0.73
Activations Density 2.835%