INDEX
Explanations
updates and announcements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.20
1.2%
1271
+0.13
0.8%
82
+0.12
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1271
+0.20
0.05
316
+0.13
0.04
82
+0.12
0.04
Negative Logits
<bos>
-3.43
ⓧ
-1.07
<?
-0.91
/***
-0.79
/**
-0.74
/*
-0.74
-0.72
<?
-0.61
/*++
-0.60
Література
-0.59
POSITIVE LOGITS
wien
1.77
lele
1.77
bayern
1.57
meis
1.56
maneu
1.55
dises
1.55
ohr
1.51
bandung
1.49
ananas
1.49
kram
1.48
Activations Density 0.108%