INDEX
Explanations
diagnosis detection
This neuron activates on words related to early cancer detection and treatment.
New Auto-Interp
Negative Logits
گو
-0.07
([{-0.06
.lastIndexOf
-0.06
γο
-0.06
Islanders
-0.06
/****************************************************************
-0.06
microscopic
-0.05
hoş
-0.05
inh
-0.05
ois
-0.05
POSITIVE LOGITS
')));↵↵
0.07
)"↵↵
0.07
*/↵↵
0.07
丝
0.06
arrested
0.06
]);↵↵
0.06
Points
0.06
sahuje
0.06
")↵↵
0.06
vitamins
0.06
Activations Density 0.010%