INDEX
Explanations
words related to legal procedures and organizational processes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1482
+0.16
0.6%
687
+0.14
0.5%
144
+0.12
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
144
+0.16
0.03
1482
+0.14
0.03
860
+0.12
0.03
Negative Logits
dikeluarkan
-0.55
mekte
-0.55
ditetapkan
-0.54
dialami
-0.54
Datuak
-0.49
diğinde
-0.49
diadakan
-0.48
ovunque
-0.47
UnitTesting
-0.45
indietro
-0.45
POSITIVE LOGITS
Di
1.06
di
1.01
Di
0.99
DI
0.92
Dib
0.85
di
0.84
diar
0.83
Dip
0.79
Diar
0.77
DI
0.77
Activations Density 0.166%