INDEX
Explanations
locations and descriptions of scenes or objects
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
382
+0.20
0.6%
1535
+0.19
0.6%
2034
+0.16
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
382
+0.20
0.10
1535
+0.19
0.07
140
+0.16
0.07
Negative Logits
hairc
-0.93
Lma
-0.89
Lmfao
-0.89
Darum
-0.87
Fuckin
-0.86
Souha
-0.86
Endlich
-0.86
notor
-0.86
suspic
-0.84
FTFY
-0.82
POSITIVE LOGITS
There
0.70
It
0.69
They
0.68
This
0.64
Then
0.63
The
0.61
These
0.60
An
0.59
Its
0.58
***!
0.58
Activations Density 0.491%