INDEX
Explanations
alcohol consumption and abuse
New Auto-Interp
Negative Logits
maline
0.37
Thickness
0.36
ilience
0.36
Particle
0.36
ρι
0.35
नियो
0.35
Futuristic
0.35
orthodont
0.35
uraea
0.35
龈
0.35
POSITIVE LOGITS
alcohol
3.13
Alcohol
2.84
Alcohol
2.83
alkohol
2.81
alcohol
2.80
алкого
2.73
alcoholic
2.70
Alkohol
2.63
alcool
2.56
alkoh
2.55
Activations Density 0.034%