INDEX
Explanations
phrases that suggest it is looking at reviews or descriptions of various products or services
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1124
+0.14
0.9%
1974
+0.14
0.8%
1059
+0.14
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1059
+0.14
0.04
1974
+0.14
0.03
1194
+0.14
0.03
Negative Logits
<bos>
-2.22
chränk
-0.61
intensify
-0.60
/**
-0.60
mustered
-0.59
IEnumerator
-0.56
strove
-0.56
consolidate
-0.56
GenerationType
-0.55
materialize
-0.55
POSITIVE LOGITS
Cha
1.60
Cha
1.51
cha
1.30
CHA
1.18
Chá
1.14
Chap
1.09
cha
1.09
Chamb
1.05
Chappell
1.02
CHAP
1.01
Activations Density 0.170%