INDEX
Explanations
numerical data structured in a specific format
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1343
+0.10
0.3%
1978
+0.09
0.2%
1870
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.10
0.06
1978
+0.09
0.05
453
+0.08
0.05
Negative Logits
guarante
-1.49
intersper
-1.46
increa
-1.45
impra
-1.40
gaily
-1.37
fta
-1.36
encomp
-1.36
attemp
-1.35
alre
-1.34
disagre
-1.33
POSITIVE LOGITS
itong
0.74
lamang
0.69
silang
0.66
AutoresizingMask
0.64
troviamo
0.61
sappiamo
0.61
maging
0.60
vuol
0.60
xFFFFFF
0.60
ieR
0.58
Activations Density 0.228%