INDEX
Explanations
phrases related to legal and political matters
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
227
+0.15
0.4%
453
+0.11
0.3%
1600
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.15
0.05
1600
+0.11
0.04
1957
+0.08
0.02
Negative Logits
impra
-2.45
increa
-2.37
maneu
-2.33
indestru
-2.31
scrat
-2.30
disreg
-2.25
suscep
-2.20
affor
-2.18
inev
-2.13
guarante
-2.13
POSITIVE LOGITS
<bos>
1.19
vice
0.84
orteur
0.79
deputy
0.79
chief
0.78
senior
0.77
lead
0.77
MigrationBuilder
0.75
queryInterface
0.74
Από
0.74
Activations Density 0.162%