INDEX
Explanations
technical terms related to advancements in technology
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
776
+0.11
0.3%
1870
+0.11
0.3%
1385
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
459
+0.11
0.05
563
+0.11
0.03
678
+0.09
0.04
Negative Logits
laft
-0.72
Juf
-0.66
leaft
-0.65
Theile
-0.63
ftre
-0.63
defire
-0.63
fta
-0.62
Bakar
-0.62
feen
-0.61
Rine
-0.61
POSITIVE LOGITS
-
0.94
‑
0.78
‐
0.69
😭😭
0.58
CENTRO
0.58
🤣🤣
0.58
👆
0.58
giù
0.57
ậc
0.56
molta
0.55
Activations Density 0.353%