INDEX
Explanations
mentions of awards, particularly in the context of literary or artistic achievements
New Auto-Interp
Negative Logits
asks
-0.18
amarin
-0.17
fang
-0.16
нÑĸм
-0.15
aley
-0.15
posables
-0.15
unkt
-0.15
ãĤ¤ãĥī
-0.15
ues
-0.15
aksi
-0.15
POSITIVE LOGITS
-winning
0.30
winning
0.24
awarded
0.20
award
0.19
borg
0.18
win
0.17
award
0.17
won
0.16
red
0.16
-w
0.16
Activations Density 0.012%