INDEX
Explanations
the word "base" and its context in discussions of foundational elements or reference points
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
445
+0.14
0.8%
144
+0.14
0.8%
41
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
463
+0.14
0.03
336
+0.14
0.02
487
+0.12
0.02
Negative Logits
oyer
-1.85
INGTON
-1.63
dated
-1.52
){#-1.50
nick
-1.49
burg
-1.47
ements
-1.46
bos
-1.46
OF
-1.40
OLOG
-1.39
POSITIVE LOGITS
¸
1.84
¼
1.72
µ
1.56
Ĥ
1.52
continuously
1.46
prevent
1.44
Ãł
1.43
CDATA
1.41
apa
1.39
cé
1.38
Activations Density 0.011%