INDEX
Explanations
terms and phrases related to scientific concepts and research
New Auto-Interp
Negative Logits
/−
-0.44
Dwyer
-0.43
others
-0.43
resourceCulture
-0.43
LLOW
-0.42
outgoing
-0.42
Garza
-0.41
ppas
-0.41
—–
-0.41
Jordan
-0.41
POSITIVE LOGITS
science
1.45
Science
1.39
science
1.38
SCIENCE
1.37
Science
1.34
scientific
1.22
Scientific
1.21
SCIENCE
1.20
Scientific
1.20
scientific
1.18
Activations Density 0.060%