INDEX
Explanations
references to standardized testing and educational assessments
New Auto-Interp
Negative Logits
.edu
-0.19
undergraduate
-0.19
undergrad
-0.18
.GroupLayout
-0.17
antro
-0.16
MBA
-0.16
ip
-0.15
graduate
-0.15
university
-0.15
.wikipedia
-0.15
POSITIVE LOGITS
IB
0.23
AP
0.23
teens
0.19
high
0.19
High
0.19
SAT
0.18
HS
0.17
Teens
0.17
.AP
0.17
Rig
0.17
Activations Density 0.184%