INDEX
Explanations
occurrences of the verb "examine" and its variations in the context of research or analysis
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
23
+0.30
1.8%
156
+0.21
1.2%
148
+0.17
1.0%
Correlated Neurons
Index
P. Corr.
Cos Sim.
148
+0.30
0.02
115
+0.21
0.01
45
+0.17
0.01
Negative Logits
IJ
-2.27
ŀ
-2.09
Ħ
-2.01
Ľ
-1.98
Ĵ
-1.95
§
-1.77
ĸ
-1.76
Ģ
-1.74
ĭ
-1.71
ī
-1.68
POSITIVE LOGITS
them
1.93
iqu
1.74
them
1.62
itely
1.54
ifferences
1.46
the
1.45
ivities
1.44
findings
1.44
how
1.43
idine
1.43
Activations Density 0.076%