INDEX
Explanations
text related to technology and information sharing
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1150
+0.13
0.4%
906
+0.12
0.4%
1445
+0.11
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
924
+0.13
0.04
1968
+0.12
0.03
1445
+0.11
0.05
Negative Logits
Và
-0.65
Còn
-0.64
Parabéns
-0.62
Quiénes
-0.61
Conclusiones
-0.60
Daarna
-0.59
Preparación
-0.58
Nhưng
-0.58
Nascimento
-0.57
==""){-0.57
POSITIVE LOGITS
https
1.09
intermitt
1.08
guarante
1.06
http
1.03
gild
1.03
https
1.00
affor
0.99
scrat
0.98
erad
0.98
arn
0.97
Activations Density 0.215%