INDEX
Explanations
phrases related to comparing and contrasting different elements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1387
+0.13
0.4%
1325
+0.12
0.4%
1981
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1981
+0.13
0.03
223
+0.12
0.03
1387
+0.12
0.03
Negative Logits
FlatAppearance
-0.53
lcccccc
-0.49
Leith
-0.47
Estrella
-0.47
Ké
-0.46
Wul
-0.46
Sosa
-0.45
Wyn
-0.45
Vann
-0.44
dikt
-0.43
POSITIVE LOGITS
comparisons
1.08
comparing
1.08
comparison
1.08
compare
1.08
compares
1.05
comparaison
0.98
Comparison
0.97
Compare
0.96
Compare
0.96
Comparisons
0.95
Activations Density 0.078%