INDEX
Explanations
interactions and actions in specific contexts or scenarios, particularly involving public or legal matters
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
17
+0.22
1.3%
436
+0.17
1.0%
12
+0.15
0.9%
Correlated Neurons
Index
P. Corr.
Cos Sim.
373
+0.22
0.07
436
+0.17
0.06
126
+0.15
0.04
Negative Logits
»¿
-2.41
Ŀ
-2.26
ŀ
-2.23
§
-2.12
¢
-2.09
º
-2.03
ľ
-2.01
¸
-1.99
¿½
-1.98
ª
-1.97
POSITIVE LOGITS
reon
1.63
rails
1.55
InstanceState
1.44
ue
1.42
contracts
1.42
amacare
1.37
Charg
1.36
ua
1.36
imetry
1.33
walls
1.30
Activations Density 1.198%