INDEX
Explanations
information related to viruses and medical research involving viruses
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
537
+0.08
0.3%
1092
+0.08
0.3%
517
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
537
+0.08
0.03
517
+0.08
0.02
144
+0.08
0.03
Negative Logits
<bos>
-1.10
落
-0.73
,
-0.68
大
-0.68
立
-0.68
เล
-0.68
補
-0.67
出
-0.67
周
-0.67
can
-0.66
POSITIVE LOGITS
increa
2.21
affor
2.15
maneu
2.15
accla
2.13
effe
2.04
Virus
2.03
guarante
2.01
emphat
2.01
ftu
1.98
fath
1.98
Activations Density 0.171%