INDEX
Explanations
expressions related to reckless behavior or actions
references to reckless behavior
New Auto-Interp
Negative Logits
ļéĨĴ
-0.88
olithic
-0.85
culosis
-0.77
*/(
-0.76
rose
-0.76
Rite
-0.71
olith
-0.71
berman
-0.71
ophon
-0.70
quart
-0.70
POSITIVE LOGITS
reck
0.89
endanger
0.88
disregard
0.85
lessly
0.81
err
0.79
ACTIONS
0.78
negligence
0.77
careless
0.75
reckless
0.74
respons
0.73
Activations Density 0.036%