INDEX
Explanations
phrases indicating the passage of time or progression
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1103
+0.14
0.4%
1272
+0.10
0.3%
856
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1103
+0.14
0.05
728
+0.10
0.03
134
+0.09
0.03
Negative Logits
heapq
-0.81
cześ
-0.71
pymysql
-0.61
embodi
-0.59
psycopg
-0.58
;;)
-0.58
smtplib
-0.58
xffffffff
-0.52
Ottobre
-0.52
gettyimages
-0.50
POSITIVE LOGITS
July
0.69
December
0.69
June
0.67
February
0.66
by
0.64
November
0.64
August
0.63
January
0.63
April
0.62
September
0.61
Activations Density 0.148%