INDEX
Explanations
phrases related to listening, suggestions, and conversations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
517
+0.11
0.4%
303
+0.11
0.4%
130
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
376
+0.11
0.03
130
+0.11
0.03
791
+0.11
0.02
Negative Logits
kram
-0.71
alkoh
-0.68
kosme
-0.65
akut
-0.63
moza
-0.63
stoff
-0.63
krim
-0.63
sement
-0.61
Singapur
-0.61
Okt
-0.61
POSITIVE LOGITS
listen
1.28
listening
1.22
listened
1.21
Listen
1.18
listens
1.15
listening
1.13
listen
1.12
LISTEN
1.11
Listening
1.10
Listening
1.08
Activations Density 0.054%