INDEX
Explanations
criteria for non-discrimination and equal opportunities based on traits like race, gender, sexual orientation, and disability
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.19
1.1%
1870
+0.11
0.6%
1044
+0.09
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
437
+0.19
0.02
1077
+0.11
0.02
765
+0.09
0.02
Negative Logits
<bos>
-3.40
ⓧ
-1.06
-0.93
/**
-0.90
<?
-0.87
/*
-0.83
/*++
-0.77
<?
-0.77
find
-0.71
<!--
-0.70
POSITIVE LOGITS
Juf
1.70
Minang
1.67
stockholm
1.61
aen
1.60
bandung
1.59
dises
1.58
riviera
1.53
thut
1.53
Augu
1.53
maer
1.53
Activations Density 0.062%