INDEX
Explanations
phrases related to loss or danger to human life
references to human lives and their significance in various contexts, particularly those involving risk or loss
New Auto-Interp
Negative Logits
CAST
-0.70
ane
-0.70
NetMessage
-0.64
atorial
-0.62
Marginal
-0.62
ripp
-0.60
ractive
-0.60
ggles
-0.59
IDE
-0.58
iasis
-0.58
POSITIVE LOGITS
lives
0.86
chool
0.84
journal
0.83
lihood
0.82
sole
0.81
©¶æ
0.77
Forever
0.76
ynthesis
0.74
pun
0.73
bage
0.72
Activations Density 0.014%