INDEX
Explanations
phrases related to technology and business operations
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1287
+0.08
0.2%
402
+0.07
0.2%
1625
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1287
+0.08
0.04
37
+0.07
0.03
1625
+0.07
0.03
Negative Logits
disagre
-1.18
hairc
-1.00
shenan
-0.92
unspeak
-0.91
intersper
-0.90
unwarran
-0.83
reluct
-0.83
caprice
-0.82
pamph
-0.82
Ename
-0.81
POSITIVE LOGITS
<bos>
0.68
bewerken
0.68
Chham
0.65
Савезне
0.64
Aholisi
0.59
progresses
0.59
Baillargeon
0.58
Flur
0.56
increasingly
0.56
препратки
0.56
Activations Density 0.282%