INDEX
Explanations
content related to products and giveaways
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
297
+0.09
0.3%
1177
+0.09
0.3%
1556
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1782
+0.09
0.06
1556
+0.09
0.05
647
+0.09
0.03
Negative Logits
impon
-0.82
utop
-0.74
häm
-0.73
magazin
-0.69
makro
-0.68
geolog
-0.68
pól
-0.67
„,
-0.65
ordina
-0.65
doveva
-0.64
POSITIVE LOGITS
apiece
0.69
totaling
0.64
warran
0.58
namely
0.57
totalling
0.57
demurrer
0.56
oltán
0.54
worth
0.53
declaratory
0.52
krish
0.51
Activations Density 0.339%