INDEX
Explanations
references to colon cancer and related health issues
New Auto-Interp
Negative Logits
èĥ¸
-0.15
breast
-0.15
turnstile
-0.14
orra
-0.14
æ²¢
-0.14
aub
-0.14
_scal
-0.14
ĵ
-0.14
chest
-0.14
csi
-0.14
POSITIVE LOGITS
Colon
0.44
colon
0.43
col
0.40
Colon
0.38
colore
0.34
colon
0.33
Col
0.33
rect
0.32
stool
0.31
sigmoid
0.31
Activations Density 0.040%