INDEX
Explanations
expressions related to bravery and courage
New Auto-Interp
Negative Logits
lue
-0.16
acro
-0.15
ãĥĨãĥ«
-0.14
енз
-0.14
vore
-0.14
tega
-0.14
ponder
-0.14
loub
-0.14
avana
-0.14
lis
-0.13
POSITIVE LOGITS
ously
0.21
courage
0.18
bold
0.17
-bold
0.16
ous
0.16
æķ¢
0.15
courageous
0.15
bold
0.15
fires
0.14
lessly
0.14
Activations Density 0.031%