INDEX
Explanations
phrases related to prestigious awards such as Nobel Prize and Pulitzer Prize
mentions of prestigious awards, particularly the Nobel and Pulitzer Prizes
New Auto-Interp
Negative Logits
addock
-0.73
icago
-0.72
alach
-0.71
aves
-0.71
aved
-0.67
andering
-0.66
Reyn
-0.65
afort
-0.65
\-
-0.65
othy
-0.64
POSITIVE LOGITS
Prize
1.20
laureate
1.13
laure
1.13
Nobel
1.12
Laure
1.00
Nob
0.92
prize
0.78
medal
0.77
Lect
0.76
physicist
0.74
Activations Density 0.006%