INDEX
Explanations
job titles and roles in employment contexts
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.32
1.2%
1013
+0.14
0.5%
227
+0.14
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
227
+0.32
0.07
1013
+0.14
0.07
1980
+0.14
0.04
Negative Logits
<bos>
-1.53
//---
-0.56
lengthen
-0.56
//<
-0.55
//...
-0.55
ૌ
-0.54
contend
-0.54
transcend
-0.50
//--
-0.50
⇔
-0.49
POSITIVE LOGITS
Juf
0.96
zove
0.96
Bekasi
0.96
Palembang
0.96
rafra
0.93
swarovski
0.92
Minang
0.91
montagna
0.91
Italij
0.89
ducato
0.88
Activations Density 0.552%