INDEX
Explanations
mentions of prestigious awards, accolades, or institutions
instances of the word "prestigious."
New Auto-Interp
Negative Logits
ghan
-0.68
sil
-0.66
irez
-0.65
Twe
-0.65
creator
-0.65
plant
-0.64
activated
-0.64
uber
-0.64
harm
-0.64
Radio
-0.63
POSITIVE LOGITS
accol
0.94
awards
0.91
cffff
0.88
prestigious
0.85
prizes
0.85
honors
0.83
coveted
0.78
award
0.78
prest
0.77
endorsements
0.74
Activations Density 0.034%