INDEX
Explanations
statements related to research findings or studies
discussions about scientific studies and their conclusions
New Auto-Interp
Negative Logits
orld
-0.74
tesy
-0.74
ruary
-0.74
@#
-0.73
yssey
-0.73
çīĪ
-0.73
=-=-
-0.73
\\\\\\\\
-0.73
Tokens
-0.72
nesday
-0.71
POSITIVE LOGITS
obesity
1.23
antidepressant
1.14
antibiotic
1.12
contraceptive
1.10
antidepressants
1.09
HPV
1.06
genetically
1.05
diets
1.05
fertility
1.05
cancers
1.04
Activations Density 0.756%