INDEX
Explanations
phrases and terms related to mental health and behavioral changes
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.38
1.3%
604
+0.09
0.3%
906
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
595
+0.38
0.05
1553
+0.09
0.07
736
+0.09
0.07
Negative Logits
<bos>
-1.76
ClientSize
-0.56
Referències
-0.56
evangé
-0.55
respecte
-0.52
persa
-0.50
pous
-0.50
profili
-0.49
//
-0.49
public
-0.48
POSITIVE LOGITS
pymysql
1.39
smtplib
1.22
heapq
1.13
pylab
1.10
hashlib
1.08
psycopg
1.07
dott
1.02
Jambi
1.01
pymongo
0.96
scatt
0.93
Activations Density 1.141%