INDEX
Explanations
words related to physical symptoms like vomiting, coughing, and nausea
references to vomiting and respiratory symptoms
New Auto-Interp
Negative Logits
natureconservancy
-0.75
Else
-0.64
esan
-0.64
Kinnikuman
-0.64
HCR
-0.63
standings
-0.62
adr
-0.62
Goal
-0.61
rosse
-0.61
Reply
-0.60
POSITIVE LOGITS
pants
0.82
vomiting
0.82
inducing
0.81
weed
0.76
vomit
0.72
osis
0.72
ards
0.71
itus
0.70
overboard
0.70
aware
0.67
Activations Density 0.011%