INDEX
Explanations
words related to risk or risky actions
terms related to risk
New Auto-Interp
Negative Logits
upon
-0.91
Nap
-0.81
innie
-0.78
ann
-0.78
oin
-0.76
ental
-0.75
bert
-0.75
through
-0.74
elf
-0.74
onne
-0.73
POSITIVE LOGITS
risky
1.19
gamble
1.07
gamb
0.97
bets
0.89
risks
0.84
risk
0.83
proposition
0.82
sounding
0.78
undermin
0.76
maneuvers
0.74
Activations Density 0.011%