INDEX
Explanations
information related to proofreading, specifically focusing on checking spelling, grammar, and following up on errors
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
381
+0.14
0.4%
876
+0.12
0.4%
1403
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2044
+0.14
0.06
1403
+0.12
0.01
82
+0.12
0.03
Negative Logits
dises
-1.31
hina
-1.16
kac
-1.10
seiz
-1.09
saar
-1.08
antik
-1.07
uhr
-1.07
Cik
-1.07
Juf
-1.06
gubern
-1.06
POSITIVE LOGITS
tupperware
0.83
easiest
0.72
usually
0.72
solicited
0.71
thinkable
0.67
moż
0.67
swarovski
0.67
wspania
0.67
velour
0.67
Ename
0.66
Activations Density 0.359%