INDEX
Explanations
mentions of physical states or actions related to loss of consciousness
terms related to unconsciousness and states of incapacitation
New Auto-Interp
Negative Logits
ERO
-0.79
hetti
-0.79
istor
-0.77
risome
-0.73
tein
-0.71
ourke
-0.71
ickr
-0.70
idated
-0.69
adal
-0.67
URI
-0.67
POSITIVE LOGITS
ness
1.10
nesses
0.88
ly
0.81
unconscious
0.69
lings
0.69
liner
0.66
bystand
0.64
baggage
0.63
butt
0.63
LY
0.63
Activations Density 0.013%