INDEX
Explanations
legal terms and information related to data protection and privacy
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
453
+0.13
0.4%
906
+0.08
0.2%
2030
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
453
+0.13
0.04
432
+0.08
0.01
1098
+0.08
0.02
Negative Logits
Numerade
-0.61
хьтан
-0.60
Wikiseite
-0.60
desertcart
-0.56
kasarigan
-0.55
contentLoaded
-0.55
ArrowToggle
-0.54
disambiguazione
-0.53
OCCURRED
-0.53
Vidite
-0.53
POSITIVE LOGITS
dises
1.05
wien
0.95
fluo
0.95
inder
0.95
abnorm
0.92
embodi
0.92
ardu
0.92
bett
0.92
desir
0.92
oner
0.91
Activations Density 0.227%