INDEX
Explanations
proper nouns related to educational institutions
mentions of specific educational institutions, particularly Cornell and Temple Universities
New Auto-Interp
Negative Logits
ipper
-0.80
hire
-0.79
entimes
-0.75
odic
-0.74
orter
-0.73
iasm
-0.73
ombies
-0.71
glomer
-0.69
arbon
-0.69
ortal
-0.69
POSITIVE LOGITS
Cornell
0.83
Haas
0.79
Milk
0.74
ighton
0.72
Davis
0.69
Triangle
0.67
McGill
0.67
ANCE
0.67
fing
0.67
Heights
0.66
Activations Density 0.024%