INDEX
Explanations
phrases related to technical instructions and procedures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.35
1.8%
1967
+0.10
0.5%
752
+0.09
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
752
+0.35
0.09
1334
+0.10
0.07
1034
+0.09
0.07
Negative Logits
<bos>
-3.31
ⓧ
-1.36
/**
-1.30
<?
-1.28
-1.03
/*
-0.96
<?
-0.92
/*++
-0.81
/***
-0.81
/*!
-0.69
POSITIVE LOGITS
lele
1.18
wien
1.13
saar
1.09
opport
1.04
vne
1.04
accla
1.02
ibiza
1.02
bandung
1.02
maneu
1.01
Juf
1.01
Activations Density 0.822%