INDEX
Explanations
phrases related to treatment and effects of medication on health
New Auto-Interp
Negative Logits
GRAM
-0.18
kening
-0.15
snap
-0.15
tres
-0.15
amble
-0.15
snap
-0.15
isto
-0.14
Snap
-0.14
agne
-0.14
ubbles
-0.14
POSITIVE LOGITS
vom
0.51
vomiting
0.48
nausea
0.37
projectile
0.33
nause
0.33
sickness
0.32
Projectile
0.32
gag
0.31
von
0.30
sick
0.30
Activations Density 0.054%