INDEX
Explanations
references to the term "current" in various contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.30
1.7%
2011
+0.11
0.6%
479
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
479
+0.30
0.04
1942
+0.11
0.03
200
+0.10
0.03
Negative Logits
<bos>
-2.99
ⓧ
-0.62
///**
-0.61
//{-0.59
Havolalar
-0.59
namespace
-0.59
adopt
-0.59
define
-0.58
activate
-0.58
Ligações
-0.57
POSITIVE LOGITS
affor
1.70
Juf
1.56
accla
1.54
fta
1.51
maneu
1.49
reluct
1.46
Intere
1.46
aen
1.46
ftu
1.45
milf
1.44
Activations Density 0.040%