INDEX
Explanations
legal terms and court orders
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
365
+0.14
0.8%
182
+0.13
0.7%
111
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
111
+0.14
0.10
271
+0.13
0.04
156
+0.13
0.05
Negative Logits
cy
-2.12
pt
-1.52
lived
-1.51
power
-1.41
space
-1.37
apolis
-1.37
open
-1.36
everyday
-1.36
spaces
-1.34
appeared
-1.33
POSITIVE LOGITS
EMENT
2.11
ONG
1.78
ERTY
1.70
INGS
1.70
ERR
1.67
imony
1.66
naire
1.62
erior
1.62
ICE
1.62
IVE
1.61
Activations Density 0.377%