INDEX
Explanations
sentences related to adding content and interacting with online platforms
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.18
0.6%
1729
+0.06
0.2%
1224
+0.06
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1802
+0.18
0.04
924
+0.06
0.05
1729
+0.06
0.04
Negative Logits
<bos>
-2.49
BufferException
-0.89
CPtr
-0.75
Wikimedijinoj
-0.74
illots
-0.73
endwhile
-0.73
CreateIndex
-0.73
runApp
-0.72
HasColumnType
-0.72
UnitTesting
-0.72
POSITIVE LOGITS
maneu
2.59
reluct
2.52
impra
2.51
affor
2.45
increa
2.45
encomp
2.43
shenan
2.41
disagre
2.38
inev
2.37
accla
2.36
Activations Density 0.444%