INDEX
Explanations
phrases related to negative customer experiences, complaints, and issues, as well as phrases related to financial matters and business restructuring
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
690
+0.10
0.3%
1843
+0.10
0.3%
678
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
509
+0.10
0.08
1843
+0.10
0.06
1531
+0.09
0.06
Negative Logits
effe
-0.95
sappi
-0.95
guarante
-0.95
perciò
-0.92
volunte
-0.91
inev
-0.90
embra
-0.88
emphat
-0.86
accla
-0.85
pertanto
-0.82
POSITIVE LOGITS
their
0.87
them
0.77
his
0.77
its
0.76
your
0.69
it
0.67
her
0.67
themselves
0.66
their
0.64
THEIR
0.64
Activations Density 0.555%