INDEX
Explanations
detailed descriptions related to historical events or figures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
184
+0.24
1.0%
1577
+0.22
0.9%
50
+0.19
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
184
+0.24
0.04
394
+0.22
0.06
1577
+0.19
0.09
Negative Logits
<bos>
-1.10
betweenstory
-0.87
AndroidJUnit
-0.56
Wikimedijinoj
-0.55
Italijani
-0.55
abestanden
-0.53
脚注の使い方
-0.52
AutoScaleMode
-0.51
WindowConstants
-0.48
انجليز
-0.47
POSITIVE LOGITS
heapq
0.69
YMMV
0.53
Bereits
0.47
Wię
0.47
Cześć
0.43
hashlib
0.43
Hahah
0.43
ἀ
0.42
Contactez
0.42
Kedves
0.42
Activations Density 1.565%