INDEX
Explanations
references to competitions and awards
New Auto-Interp
Negative Logits
ilim
-0.15
hey
-0.15
ogany
-0.14
pedia
-0.14
agens
-0.14
alink
-0.14
iani
-0.13
γον
-0.13
cctor
-0.13
REFER
-0.13
POSITIVE LOGITS
winner
0.28
winners
0.28
selected
0.26
winner
0.23
selected
0.23
chosen
0.22
select
0.22
selects
0.21
winning
0.21
vict
0.21
Activations Density 0.092%