INDEX
Explanations
phrases related to a mix or blend of different elements or characteristics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1899
+0.10
0.3%
678
+0.09
0.2%
752
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1899
+0.10
0.03
766
+0.09
0.03
814
+0.08
0.01
Negative Logits
McLaugh
-0.75
Rine
-0.71
Gorb
-0.65
apprehen
-0.63
vainly
-0.62
Larg
-0.62
Gies
-0.61
Vaugh
-0.60
Cuth
-0.60
,¹
-0.60
POSITIVE LOGITS
paradiso
0.72
palio
0.69
AssemblyCompany
0.67
affari
0.64
vacanza
0.60
treno
0.60
tibi
0.59
calyx
0.58
soggior
0.57
perdere
0.56
Activations Density 0.163%