INDEX
Explanations
phrases related to statistics and data analysis
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
478
+0.15
0.7%
50
+0.11
0.5%
405
+0.11
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1490
+0.15
0.05
405
+0.11
0.05
478
+0.11
0.03
Negative Logits
<bos>
-2.52
ⓧ
-0.97
/***
-0.97
endow
-0.92
<?
-0.87
intersper
-0.85
-0.78
rejoin
-0.78
aggravate
-0.73
endeavoured
-0.73
POSITIVE LOGITS
paradiso
1.08
chrysler
0.99
tramonto
0.99
broderie
0.98
pection
0.98
pioggia
0.97
lamborghini
0.94
tionally
0.92
eiffel
0.92
requently
0.91
Activations Density 0.372%