INDEX
Explanations
words related to writing or inscriptions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.24
0.9%
1150
+0.15
0.6%
1839
+0.07
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1150
+0.24
0.04
957
+0.15
0.04
1389
+0.07
0.04
Negative Logits
<bos>
-2.75
ⓧ
-1.20
/**
-1.14
<?
-1.10
-1.02
/*
-0.90
/***
-0.83
/*++
-0.77
<?
-0.68
Transcripción
-0.62
POSITIVE LOGITS
practition
1.09
maneu
1.04
Keny
1.00
Minang
1.00
maroc
1.00
disreg
0.99
despotism
0.98
Juf
0.94
EEU
0.94
shenan
0.93
Activations Density 1.442%