INDEX
Explanations
phrases related to data analysis and statistics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1870
+0.14
0.4%
1839
+0.09
0.3%
1415
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1870
+0.14
0.03
563
+0.09
0.03
245
+0.08
0.02
Negative Logits
tuu
-0.55
ikiki
-0.55
esternos
-0.53
Meksiku
-0.53
Pines
-0.50
uuuuu
-0.50
Datuak
-0.49
qas
-0.49
Briefs
-0.48
архивлан
-0.47
POSITIVE LOGITS
greja
0.73
Qualquer
0.71
Cringe
0.69
Noice
0.68
Lmfao
0.68
ajuns
0.68
deschis
0.68
Portanto
0.67
Ikr
0.66
FTFY
0.66
Activations Density 0.320%