INDEX
Explanations
references to institutions or organizations, specifically focusing on identifying Harvard University
references to prestigious educational institutions, particularly Harvard
New Auto-Interp
Negative Logits
hers
-0.68
let
-0.65
headers
-0.63
lets
-0.61
while
-0.60
apses
-0.58
falls
-0.58
neut
-0.58
Ãĥ
-0.58
theirs
-0.57
POSITIVE LOGITS
phabet
0.76
Crimes
0.69
Deity
0.68
Offense
0.67
£ı
0.67
Evening
0.67
DragonMagazine
0.66
stanbul
0.65
iband
0.65
Reported
0.64
Activations Density 0.330%