INDEX
Explanations
terms related to carcinogenicity and cancer-related concepts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.20
1.2%
376
+0.12
0.7%
125
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
10
+0.20
0.09
98
+0.12
0.04
165
+0.10
0.07
Negative Logits
ĥ½
-2.26
ĨĴ
-2.25
į
-2.09
↵
-2.07
-2.07
↵
-2.07
-2.07
-2.07
č↵
-2.07
↵
-2.07
POSITIVE LOGITS
patrick
1.85
"}](#
1.75
sey
1.66
measures
1.59
hold
1.59
ster
1.57
shots
1.53
oked
1.50
calc
1.49
mong
1.48
Activations Density 3.912%