INDEX
Explanations
paragraphs related to legal matters and issues
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.08
0.2%
872
+0.08
0.2%
1592
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1781
+0.08
0.06
1081
+0.08
0.06
1372
+0.07
0.05
Negative Logits
nachron
-0.85
ordina
-0.80
postolic
-0.79
madeus
-0.78
ché
-0.77
notori
-0.77
Ordre
-0.76
capulco
-0.75
roba
-0.75
reputa
-0.75
POSITIVE LOGITS
regarding
0.78
concerning
0.72
Kenmerken
0.66
regarding
0.66
about
0.60
Regarding
0.57
whereby
0.57
Flere
0.56
Przyp
0.54
yaitu
0.54
Activations Density 0.515%