INDEX
Explanations
phrases related to a sequence of actions or steps towards a goal
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
474
+0.12
0.4%
196
+0.12
0.4%
9
+0.10
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
474
+0.12
0.07
9
+0.12
0.05
196
+0.10
0.05
Negative Logits
pymysql
-0.83
heapq
-0.66
psycopg
-0.63
pathlib
-0.53
hashlib
-0.52
Cerca
-0.48
pymongo
-0.48
zipfile
-0.48
smtplib
-0.47
Ngb
-0.47
POSITIVE LOGITS
before
0.84
before
0.84
BEFORE
0.82
venuto
0.74
BEFORE
0.74
Before
0.71
Before
0.68
bago
0.67
sentito
0.65
antes
0.61
Activations Density 0.103%