INDEX
Explanations
mentions of the term "Jefferson"
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
486
+0.16
0.7%
1276
+0.14
0.6%
1034
+0.13
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.16
0.04
1177
+0.14
0.02
699
+0.13
0.03
Negative Logits
ceptre
-0.75
liberality
-0.66
Vaugh
-0.66
shewn
-0.66
otheby
-0.64
Shaksp
-0.62
ingrat
-0.59
kaitan
-0.59
leece
-0.58
ungguh
-0.57
POSITIVE LOGITS
Jefferson
1.23
Jefferson
1.12
Madison
0.80
Alexander
0.74
Madison
0.67
Ké
0.64
Alexander
0.62
ERSON
0.58
Câ
0.56
jeff
0.56
Activations Density 0.236%