INDEX
Explanations
phrases related to a specific context or narrative, perhaps related to gaming or technology
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.16
0.9%
122
+0.06
0.3%
47
+0.06
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
47
+0.16
0.07
122
+0.06
0.06
416
+0.06
0.06
Negative Logits
<bos>
-2.35
public
-0.79
ⓧ
-0.76
<?
-0.76
//
-0.73
ोंने
-0.73
/**
-0.72
-0.70
///
-0.69
,
-0.69
POSITIVE LOGITS
stockholm
1.66
maneu
1.60
accla
1.57
fta
1.56
affor
1.55
Juf
1.55
increa
1.55
viciss
1.53
effe
1.52
ftu
1.52
Activations Density 0.085%