INDEX
Explanations
the word "Tang" and its variations or associations with "Ming."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
369
+0.18
1.0%
203
+0.14
0.8%
81
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
332
+0.18
0.01
59
+0.14
0.01
71
+0.12
0.01
Negative Logits
Ļª
-1.61
tracing
-1.60
Ĥ¬
-1.52
marker
-1.50
comple
-1.48
masking
-1.40
correcting
-1.37
covering
-1.36
ção
-1.36
connecting
-1.36
POSITIVE LOGITS
urd
1.99
doms
1.94
ultan
1.80
iei
1.75
ued
1.72
iani
1.72
ief
1.71
ue
1.68
eni
1.67
iev
1.65
Activations Density 0.023%