INDEX
Explanations
references to the "Star Trek" franchise and its elements
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.21
0.8%
86
+0.10
0.4%
1035
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
86
+0.21
0.03
304
+0.10
-0.00
227
+0.09
0.05
Negative Logits
<bos>
-2.05
intptr
-0.75
public
-0.74
censiti
-0.73
EconPapers
-0.73
utafitiHapana
-0.69
<<<<<<<<<<<<<<
-0.68
HideFlags
-0.66
IVEREF
-0.65
mergeFrom
-0.65
POSITIVE LOGITS
increa
1.88
affor
1.85
emphat
1.77
excru
1.73
ecru
1.65
perfet
1.65
disagre
1.62
milf
1.61
hairc
1.61
unwarran
1.61
Activations Density 0.284%