INDEX
Explanations
news releases and statements in a formal context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1150
+0.17
0.5%
453
+0.13
0.4%
1343
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
613
+0.17
0.06
1150
+0.13
0.04
859
+0.10
0.04
Negative Logits
swarovski
-1.67
hairc
-1.58
ecru
-1.54
unden
-1.50
increa
-1.49
eiffel
-1.44
fta
-1.43
ftu
-1.43
desir
-1.38
pollut
-1.38
POSITIVE LOGITS
statement
0.90
Statement
0.79
statement
0.78
announcement
0.75
Statement
0.74
spokesperson
0.72
spokesman
0.70
spokeswoman
0.69
issued
0.67
statements
0.65
Activations Density 0.225%