INDEX
Explanations
terms related to medical treatments and experimental conditions
references to placebo effects in medical contexts
New Auto-Interp
Negative Logits
hani
-0.73
orian
-0.71
dar
-0.70
laws
-0.68
Allen
-0.67
lar
-0.67
odes
-0.66
creditors
-0.65
ORN
-0.64
don
-0.64
POSITIVE LOGITS
placebo
1.11
analges
0.84
veyard
0.84
nesday
0.72
acupuncture
0.71
aspirin
0.69
lette
0.68
mosqu
0.68
aneously
0.67
augment
0.66
Activations Density 0.012%