INDEX
Explanations
phrases related to news events and actions taken
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
2019
+0.12
0.4%
1806
+0.10
0.3%
50
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1806
+0.12
0.06
239
+0.10
0.05
1921
+0.10
0.06
Negative Logits
noël
-0.99
exé
-0.93
soggior
-0.93
broderie
-0.91
poulet
-0.89
dégust
-0.89
prétend
-0.89
répon
-0.85
accompagne
-0.85
tricot
-0.84
POSITIVE LOGITS
vowed
0.72
expects
0.70
believes
0.70
encourages
0.69
wished
0.69
wondered
0.69
acknowledges
0.67
anticipates
0.66
recommends
0.65
considers
0.65
Activations Density 0.339%