INDEX
Explanations
phrases related to concealing identity and leading back to someone
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
946
+0.11
0.3%
198
+0.10
0.3%
876
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
946
+0.11
0.04
1673
+0.10
0.02
1463
+0.08
0.04
Negative Logits
kosme
-0.70
quæ
-0.67
Sén
-0.67
minimalis
-0.67
akut
-0.65
panik
-0.63
konserv
-0.63
kapital
-0.63
Cfr
-0.62
antik
-0.62
POSITIVE LOGITS
identification
0.76
identify
0.70
identifying
0.70
Identifying
0.66
Identification
0.66
identifier
0.65
identify
0.64
Identifying
0.64
traceability
0.62
Identify
0.61
Activations Density 0.379%