INDEX
Explanations
inflammatory bowel disease or arthritis
New Auto-Interp
Negative Logits
Hygiene
0.42
स्टम
0.40
atherm
0.39
funniest
0.38
exp
0.38
Cla
0.37
мело
0.37
恹
0.37
ymoon
0.37
बनाता
0.37
POSITIVE LOGITS
inflam
0.96
inflammation
0.90
inflammation
0.90
Infl
0.86
Infl
0.85
infl
0.85
inflamm
0.83
inflamed
0.80
Inflammation
0.79
inflammatory
0.79
Activations Density 0.008%