INDEX
Explanations
words related to pain relief or medical treatment
conjunctions and transitions in discourse
New Auto-Interp
Negative Logits
TW
-0.69
quet
-0.67
abit
-0.65
Correction
-0.65
Person
-0.65
someone
-0.65
Category
-0.64
iably
-0.63
a
-0.63
Either
-0.63
POSITIVE LOGITS
cumbers
0.82
penchant
0.81
complexities
0.78
the
0.77
consequ
0.77
antics
0.76
sheer
0.76
intric
0.75
resultant
0.74
unbeliev
0.74
Activations Density 0.248%