INDEX
Explanations
words related to publishing and texts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1317
+0.13
0.5%
1537
+0.09
0.3%
1842
+0.08
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1317
+0.13
0.07
1284
+0.09
0.07
509
+0.08
0.05
Negative Logits
<bos>
-2.46
Autoritní
-1.06
glMatrixMode
-0.91
contentLoaded
-0.89
脚注の使い方
-0.87
glPushMatrix
-0.86
/**
-0.84
}{||-0.82
uxxxx
-0.82
LookAnd
-0.81
POSITIVE LOGITS
shenan
2.54
maneu
2.51
affor
2.40
hentai
2.39
snoopy
2.30
milf
2.30
scrat
2.26
impra
2.24
wikihow
2.22
Juf
2.19
Activations Density 0.409%