INDEX
Explanations
phrases related to prestigious awards and honors
references to prestigious awards, particularly the Nobel Prize and Pulitzer Prize
New Auto-Interp
Negative Logits
icago
-0.78
aves
-0.72
yy
-0.69
addock
-0.68
Antar
-0.67
alach
-0.67
andering
-0.66
oute
-0.65
ECTION
-0.63
mson
-0.63
POSITIVE LOGITS
Prize
1.37
laureate
1.18
Nobel
1.03
laure
1.01
prize
0.94
Laure
0.92
Nob
0.91
prizes
0.80
Pri
0.80
Peace
0.79
Activations Density 0.009%