INDEX
Explanations
phrases associated with health risks and medical contexts
Weight loss, dieting, or interventions
weight loss and diet choices
New Auto-Interp
Negative Logits
ViewFeatures
-0.59
виправивши
-0.56
pinball
-0.54
Personendaten
-0.54
DoubleQuotes
-0.54
Савезне
-0.53
########.
-0.53
chó
-0.51
spaceBetween
-0.50
oco
-0.50
POSITIVE LOGITS
weight
1.54
Weight
1.44
dieting
1.34
Weight
1.32
diet
1.31
WEIGHT
1.29
weight
1.25
slimming
1.15
WEIGHT
1.12
diet
1.11
Activations Density 0.213%