INDEX
Explanations
words related to specific events, locations, and names, potentially related to a legal or criminal context
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1331
+0.17
0.7%
1573
+0.17
0.7%
1034
+0.15
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1331
+0.17
0.04
1573
+0.17
0.03
404
+0.15
0.03
Negative Logits
awaran
-0.47
Puritans
-0.47
Greensboro
-0.46
yelidikan
-0.45
siyang
-0.45
Cormack
-0.44
visející
-0.44
empatan
-0.43
コマ
-0.43
millan
-0.43
POSITIVE LOGITS
Cal
1.42
cal
1.36
Cal
1.35
CAL
1.33
cal
1.24
CAL
1.15
Kalifor
1.03
Kal
1.00
Cali
0.99
cale
0.98
Activations Density 0.090%