INDEX
Explanations
references to online communities or specific online platforms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1081
+0.17
0.5%
1978
+0.10
0.3%
321
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1081
+0.17
0.08
1062
+0.10
0.07
1705
+0.08
0.06
Negative Logits
unlaw
-0.94
gaily
-0.90
intersper
-0.89
fortn
-0.88
reconno
-0.88
increa
-0.86
affor
-0.86
apprehen
-0.85
purcha
-0.85
unwarran
-0.84
POSITIVE LOGITS
Hauptartikel
0.55
]=>
0.54
setVerticalGroup
0.54
fjspx
0.53
GORITH
0.52
ScopeManager
0.50
Clik
0.50
UObject
0.49
aapt
0.49
IRQn
0.49
Activations Density 0.490%