INDEX
Explanations
patterns of willingness and preparedness regarding taking risks or actions in various contexts
New Auto-Interp
Negative Logits
Laden
-0.16
hle
-0.16
solete
-0.15
ICODE
-0.15
uae
-0.15
HING
-0.15
leigh
-0.15
asia
-0.15
ernel
-0.14
agon
-0.14
POSITIVE LOGITS
willing
0.80
willingness
0.69
ready
0.46
readiness
0.42
prepared
0.41
æĦ¿
0.40
unwilling
0.39
Ready
0.38
гоÑĤов
0.37
open
0.36
Activations Density 0.251%