INDEX
Explanations
legal terms and components related to court cases
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
410
+0.15
0.9%
21
+0.12
0.7%
104
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
21
+0.15
0.03
168
+0.12
0.03
367
+0.12
0.02
Negative Logits
depends
-1.72
cribe
-1.67
itary
-1.65
varies
-1.60
eless
-1.52
ocracy
-1.47
tons
-1.46
imate
-1.46
avirus
-1.40
Notify
-1.40
POSITIVE LOGITS
Īĺ
3.13
2.86
↵Č
2.86
↵
2.86
↵
2.86
č↵
2.86
2.86
↵
2.86
<|outofrange|>
2.86
č↵
2.86
Activations Density 0.093%