INDEX
Explanations
phrases related to administrative processes or procedures
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.13
0.5%
2034
+0.05
0.2%
1385
+0.05
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1385
+0.13
0.13
491
+0.05
0.09
1288
+0.05
0.09
Negative Logits
<bos>
-1.99
/**
-1.28
/***
-1.27
ⓧ
-1.21
-1.16
<?
-1.15
<?
-1.11
///**
-1.03
/*
-0.92
//*/
-0.90
POSITIVE LOGITS
kaos
1.00
seksi
0.94
keramik
0.86
kafe
0.85
silikon
0.84
lele
0.84
mikrofon
0.83
balon
0.82
panik
0.81
optik
0.79
Activations Density 1.271%