INDEX
Explanations
phrases related to data analysis and statistical information
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
752
+0.16
0.6%
1602
+0.13
0.5%
757
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1602
+0.16
0.09
757
+0.13
0.09
1942
+0.13
0.08
Negative Logits
<bos>
-0.75
ciclopedia
-0.60
==""){-0.59
gerekir
-0.58
worin
-0.55
("="-0.53
оригіналу
-0.53
]>=
-0.53
/**
-0.52
)();
-0.52
POSITIVE LOGITS
hcm
1.06
stockholm
0.95
contex
0.94
lein
0.91
sofia
0.90
loren
0.90
lyon
0.88
suscep
0.86
lara
0.86
Juf
0.86
Activations Density 0.382%