INDEX
Explanations
specific names, particularly authors and locations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.19
0.9%
1343
+0.16
0.7%
227
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1097
+0.19
0.19
1343
+0.16
0.16
2044
+0.08
0.15
Negative Logits
<bos>
-2.22
addComponent
-0.68
onCancelled
-0.68
imanapun
-0.67
новништво
-0.65
AddTagHelper
-0.65
getSystemService
-0.65
onResponse
-0.64
enumerate
-0.63
Nhưng
-0.63
POSITIVE LOGITS
accla
1.62
affor
1.57
maneu
1.55
shenan
1.48
Wtf
1.42
impra
1.41
sappi
1.37
véhic
1.36
scrat
1.35
increa
1.35
Activations Density 1.816%