INDEX
Explanations
contact information and social media handles
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
814
+0.10
0.3%
227
+0.10
0.3%
1445
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
981
+0.10
0.03
406
+0.10
0.02
1980
+0.09
0.02
Negative Logits
}{*}{}-0.56
RemoteException
-0.55
니메이션
-0.51
}\}$
-0.49
phosphates
-0.48
kurzem
-0.48
jennifer
-0.48
personal
-0.47
URLException
-0.46
fernando
-0.46
POSITIVE LOGITS
-@
0.93
Jä
0.86
.@
0.85
solidar
0.85
[@
0.79
restre
0.75
alkoh
0.75
rafra
0.75
accla
0.74
minimalis
0.74
Activations Density 0.044%