INDEX
Explanations
words related to physical discomfort or challenges
references to physical discomfort or pains
New Auto-Interp
Negative Logits
onz
-0.75
irl
-0.74
riminal
-0.72
VERTISEMENT
-0.66
ramid
-0.66
Couch
-0.64
nom
-0.64
option
-0.64
enz
-0.63
SW
-0.63
POSITIVE LOGITS
pains
1.56
terness
0.99
staking
0.84
pain
0.80
terday
0.79
assail
0.79
indu
0.79
sore
0.78
ridden
0.75
killers
0.75
Activations Density 0.003%