INDEX
Explanations
quotes from spokespersons or spokespeople
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1150
+0.17
0.5%
1343
+0.12
0.4%
453
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
888
+0.17
0.07
1150
+0.12
0.04
1650
+0.12
0.05
Negative Logits
swarovski
-1.41
murano
-1.40
bordeaux
-1.37
vespa
-1.35
eiffel
-1.32
lidl
-1.31
ecru
-1.30
thermomix
-1.29
ibiza
-1.26
cabrio
-1.22
POSITIVE LOGITS
said
0.67
confirmed
0.66
explained
0.63
told
0.63
Datuak
0.62
ագրություններ
0.59
stated
0.59
estekak
0.57
confirm
0.55
Παραπομπές
0.55
Activations Density 0.318%