INDEX
Explanations
numbers or measurements in a document
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.23
0.9%
1967
+0.10
0.4%
1515
+0.10
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1317
+0.23
0.05
1892
+0.10
0.04
446
+0.10
0.03
Negative Logits
<bos>
-3.12
ⓧ
-0.86
<?
-0.83
-0.82
/***
-0.72
/**
-0.68
AssemblyCompany
-0.56
gynnwys
-0.56
💼
-0.55
Transcripción
-0.52
POSITIVE LOGITS
maroc
1.06
vinci
0.97
ados
0.96
brava
0.92
nomine
0.91
ananas
0.90
broderie
0.88
comesti
0.87
frambo
0.87
cioc
0.86
Activations Density 0.297%