INDEX
Explanations
words related to studies, research, and scientific findings
scientifically focused statements regarding health and well-being
New Auto-Interp
Negative Logits
continuity
-0.71
--------
-0.68
UNCLASSIFIED
-0.67
gencies
-0.66
verage
-0.65
ghazi
-0.65
":["
-0.64
cellaneous
-0.63
Lago
-0.63
thereafter
-0.63
POSITIVE LOGITS
Researchers
1.68
researchers
1.49
Researchers
1.45
Scientists
1.42
scientists
1.38
Scientists
1.34
scient
1.16
researcher
1.15
research
1.15
Redditor
1.14
Activations Density 0.487%