INDEX
Explanations
the term "data" in various contexts, particularly in legal and technical references related to liabilities and consequences
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
369
+0.13
0.8%
359
+0.12
0.7%
494
+0.11
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
320
+0.13
0.01
359
+0.12
0.01
146
+0.11
0.01
Negative Logits
our
-1.71
Our
-1.68
Connecticut
-1.62
Wyoming
-1.53
arsh
-1.53
]
-1.50
cluding
-1.49
astern
-1.48
vereign
-1.47
us
-1.42
POSITIVE LOGITS
CRIPT
1.80
ço
1.73
iliary
1.68
ño
1.66
imo
1.57
esan
1.56
rooms
1.56
CRIPTION
1.55
Rptr
1.55
ucci
1.55
Activations Density 0.022%