INDEX
Explanations
phrases related to legal/judicial phrases and procedures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
481
+0.12
0.4%
324
+0.11
0.4%
1872
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
678
+0.12
0.05
481
+0.11
0.05
1872
+0.10
0.03
Negative Logits
<bos>
-0.96
близь
-0.64
Sa
-0.64
використання
-0.62
Pa
-0.61
умова
-0.60
Alabama
-0.60
Fo
-0.60
Da
-0.60
безпе
-0.59
POSITIVE LOGITS
milano
1.57
napoli
1.56
imposs
1.55
sentra
1.52
abnorm
1.51
bandung
1.48
canel
1.46
indestru
1.45
disreg
1.43
maneu
1.43
Activations Density 0.431%