INDEX
Explanations
legal terms and questions related to legal procedures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1757
+0.17
0.8%
341
+0.13
0.6%
1127
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.17
0.03
1056
+0.13
0.03
376
+0.12
0.02
Negative Logits
<bos>
-2.06
aproveitar
-0.58
alcançar
-0.57
Villa
-0.56
MenuInflater
-0.55
Villa
-0.55
Tall
-0.55
Мексика
-0.55
aceitar
-0.54
addComponent
-0.53
POSITIVE LOGITS
judgment
1.40
Judgment
1.24
judgement
1.21
judgment
1.18
blackpink
1.09
Judgment
1.09
judgments
1.08
uniqlo
1.07
balenciaga
1.06
chèvre
1.05
Activations Density 0.310%