INDEX
Explanations
information related to academic research papers, authors, and sources, particularly in the fields of economics, sociology, technology, and security
New Auto-Interp
Negative Logits
pport
-0.66
planes
-0.61
stereotypes
-0.61
zers
-0.61
upside
-0.60
overhe
-0.59
antic
-0.58
stigma
-0.58
bucks
-0.57
ocent
-0.56
POSITIVE LOGITS
Jr
1.23
Sr
0.93
et
0.87
PhD
0.86
MD
0.79
Cosponsors
0.78
CFR
0.76
Es
0.76
Jr
0.74
Chair
0.74
Activations Density 13.827%