INDEX
Explanations
instructions or steps to complete a specific task, such as navigating through menus or creating themes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
251
+0.18
0.6%
674
+0.11
0.4%
1053
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
251
+0.18
0.11
166
+0.11
0.06
1053
+0.10
0.06
Negative Logits
乓
-0.45
FontFamily
-0.43
łaszcza
-0.43
Nhi
-0.42
aquin
-0.40
Dla
-0.40
Responsibility
-0.40
newsletter
-0.39
ítez
-0.38
regno
-0.38
POSITIVE LOGITS
goin
0.88
go
0.86
goTo
0.85
Goes
0.83
went
0.81
GOING
0.81
went
0.80
going
0.80
Went
0.77
goes
0.76
Activations Density 0.162%