INDEX
Explanations
words related to legislation, government, and regulatory systems
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1741
+0.21
0.7%
1385
+0.16
0.5%
1967
+0.14
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1967
+0.21
0.09
16
+0.16
0.11
189
+0.14
0.05
Negative Logits
alip
-0.58
==="
-0.49
allclose
-0.49
!=-
-0.47
öf
-0.46
logarith
-0.46
Erkrank
-0.46
şekkür
-0.46
:].
-0.46
])*
-0.45
POSITIVE LOGITS
parteci
0.80
vuol
0.78
dimenti
0.74
fordable
0.74
dicono
0.72
sappi
0.71
milano
0.70
thermomix
0.69
migli
0.69
interessa
0.68
Activations Density 0.842%