INDEX
Explanations
phrases related to technology and software updates
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
597
+0.14
0.5%
1637
+0.12
0.4%
1480
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
597
+0.14
0.02
1480
+0.12
0.02
1637
+0.11
0.02
Negative Logits
ophan
-0.59
Ainda
-0.59
Mesmo
-0.56
aen
-0.55
Personensuche
-0.54
Assista
-0.53
Ainda
-0.52
Conclusão
-0.51
parecia
-0.49
Conteúdo
-0.48
POSITIVE LOGITS
OS
1.11
OS
0.94
Os
0.91
os
0.87
Osborne
0.83
Os
0.83
os
0.72
Osbourne
0.69
Operating
0.68
operating
0.65
Activations Density 0.056%