INDEX
Explanations
names of prestigious universities
references to Yale University and related institutions
New Auto-Interp
Negative Logits
leased
-0.66
rive
-0.65
ãĥĩãĤ£
-0.65
ossible
-0.64
omal
-0.63
akable
-0.62
gotten
-0.62
otent
-0.61
asta
-0.61
phabet
-0.61
POSITIVE LOGITS
University
1.00
uates
0.94
classmate
0.84
Haram
0.83
alumni
0.82
graduates
0.80
Yale
0.78
undergrad
0.77
faculty
0.77
Chapel
0.75
Activations Density 0.008%