INDEX
Explanations
references to nicotine and vaping products
New Auto-Interp
Negative Logits
getSingleton
-0.16
adin
-0.15
dit
-0.15
ansa
-0.14
intage
-0.14
vez
-0.14
obl
-0.14
antis
-0.14
rong
-0.14
Canter
-0.14
POSITIVE LOGITS
ünd
0.15
Grü
0.14
rings
0.14
undefeated
0.13
Freund
0.13
flip
0.13
teen
0.13
FLT
0.13
caff
0.13
hoÃłn
0.13
Activations Density 0.004%