INDEX
Explanations
instances of fear and courage in challenging situations
New Auto-Interp
Negative Logits
aldi
-0.17
orgia
-0.15
å¿Ļ
-0.15
hurst
-0.15
nip
-0.14
зд
-0.14
azor
-0.14
zano
-0.14
_Debug
-0.14
nier
-0.14
POSITIVE LOGITS
courage
0.64
bravery
0.57
bold
0.55
courageous
0.53
brave
0.52
åĭĩ
0.52
Courage
0.51
daring
0.50
bold
0.50
risk
0.49
Activations Density 0.473%