INDEX
Explanations
complex sentences or passages with a formal tone
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
814
+0.14
0.5%
1482
+0.10
0.3%
680
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
814
+0.14
0.03
680
+0.10
0.02
645
+0.10
0.02
Negative Logits
Himo
-0.54
equila
-0.50
rália
-0.48
Ant
-0.48
Normdatei
-0.47
DMETHOD
-0.47
webElement
-0.46
Ka
-0.45
IntoConstraints
-0.45
ModelRenderer
-0.45
POSITIVE LOGITS
disreg
1.00
unspeak
0.85
jorge
0.82
shenan
0.82
invin
0.82
rodriguez
0.81
affitto
0.81
excru
0.79
perfet
0.78
ricardo
0.78
Activations Density 0.081%