INDEX
Explanations
terms related to illness or being unwell
New Auto-Interp
Negative Logits
ized
-0.19
izable
-0.19
egers
-0.16
ize
-0.16
adera
-0.15
phy
-0.15
IZED
-0.15
izer
-0.15
uren
-0.14
thritis
-0.14
POSITIVE LOGITS
ening
0.21
lesh
0.18
Lomb
0.16
sick
0.15
esse
0.15
ENDOR
0.15
먹
0.15
cratch
0.15
'n
0.15
ussion
0.15
Activations Density 0.021%