INDEX
Explanations
language related to requirements or needs
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
241
+0.10
0.3%
1053
+0.10
0.3%
1339
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
241
+0.10
0.03
1103
+0.10
0.03
1140
+0.10
0.03
Negative Logits
désigne
-0.70
prouve
-0.56
ferait
-0.52
apparaît
-0.52
trouvera
-0.51
protège
-0.49
reconnaît
-0.49
résulte
-0.48
printStats
-0.48
Truthy
-0.48
POSITIVE LOGITS
require
0.84
requiring
0.81
require
0.80
Requires
0.79
Require
0.77
requires
0.76
Require
0.71
requires
0.68
REQUIRE
0.67
requireNonNull
0.66
Activations Density 0.069%