INDEX
Explanations
words related to satisfaction or dissatisfaction
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
376
+0.16
0.9%
111
+0.14
0.8%
380
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
111
+0.16
0.01
380
+0.14
0.01
338
+0.12
0.01
Negative Logits
hip
-1.81
)\]
-1.62
*](#
-1.55
pH
-1.54
FORMATION
-1.45
tered
-1.44
acters
-1.41
alth
-1.40
)](#
-1.38
queous
-1.38
POSITIVE LOGITS
ĻĤ
1.84
institution
1.50
organizations
1.46
organisations
1.44
jurisdictions
1.43
bourg
1.40
Majesty
1.36
awards
1.36
actory
1.36
semb
1.33
Activations Density 0.022%