INDEX
Explanations
references to awards, nominations, and achievements
references to award categories and their respective nominations or wins
New Auto-Interp
Negative Logits
gypt
-0.76
probing
-0.68
utherford
-0.67
ptin
-0.65
coli
-0.63
bypass
-0.63
awake
-0.62
peror
-0.62
Juda
-0.62
manent
-0.61
POSITIVE LOGITS
Best
1.09
seller
1.06
Worst
1.01
Best
1.00
sell
0.93
iary
0.91
worst
0.91
Winner
0.87
iaries
0.83
hest
0.81
Activations Density 0.009%