INDEX
Explanations
phrases related to competition or victory in different contexts
New Auto-Interp
Negative Logits
Ùĩ
-0.18
erable
-0.16
ripper
-0.16
erate
-0.15
osate
-0.14
icari
-0.14
vá
-0.14
hoot
-0.14
gado
-0.14
781
-0.14
POSITIVE LOGITS
rice
0.19
een
0.16
nik
0.16
iful
0.16
AGMA
0.15
ichen
0.15
åĦ¿
0.15
rol
0.14
elm
0.14
_PIPE
0.14
Activations Density 0.027%