INDEX
Explanations
mentions of specific colleges and universities
New Auto-Interp
Negative Logits
bler
-0.77
ulative
-0.73
lyak
-0.71
saf
-0.70
BIP
-0.70
rahim
-0.69
ilitation
-0.67
Agent
-0.65
glers
-0.65
href
-0.64
POSITIVE LOGITS
Lauder
0.89
neau
0.85
Tire
0.83
vale
0.80
Scotia
0.77
osaurus
0.76
gomery
0.75
Heights
0.75
aval
0.74
elly
0.73
Activations Density 0.011%