INDEX
Explanations
references to mobile phones
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.18
1.0%
23
+0.14
0.8%
495
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
311
+0.18
0.03
251
+0.14
0.02
276
+0.12
0.02
Negative Logits
OUGH
-1.51
=\"
-1.50
»¿
-1.49
etc
-1.49
aucoup
-1.43
etc
-1.41
¥
-1.40
umen
-1.40
ilogy
-1.39
contra
-1.38
POSITIVE LOGITS
rophic
1.56
dock
1.54
Commun
1.52
lanes
1.51
oretic
1.47
fraction
1.45
steady
1.44
center
1.41
spots
1.40
honors
1.40
Activations Density 0.282%