INDEX
Explanations
words related to being unconscious or incapacitated
mentions of states of unconsciousness or loss of consciousness
New Auto-Interp
Negative Logits
ourke
-0.76
ERO
-0.75
20439
-0.74
ATT
-0.74
DN
-0.71
adr
-0.71
raltar
-0.70
UU
-0.68
rer
-0.67
New
-0.66
POSITIVE LOGITS
unconscious
1.02
exha
0.77
proport
0.76
uce
0.76
induction
0.74
nesses
0.74
pedia
0.73
ness
0.72
toile
0.71
selage
0.71
Activations Density 0.006%