INDEX
Explanations
words related to medical conditions characterized as not severe
instances of the word "mild" and its variations
New Auto-Interp
Negative Logits
andise
-0.83
Store
-0.70
Yards
-0.70
atography
-0.67
miah
-0.66
ilater
-0.66
berus
-0.64
Markets
-0.63
Dame
-0.63
ynthesis
-0.62
POSITIVE LOGITS
er
1.10
est
1.08
ew
0.98
erate
0.94
ers
0.91
ening
0.88
ener
0.84
ened
0.84
annoyance
0.81
ering
0.79
Activations Density 0.007%