INDEX
Explanations
mentions of actions or potential states, especially in relation to personal experiences
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.21
1.3%
1896
+0.10
0.7%
1034
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
537
+0.21
0.06
1124
+0.10
0.05
1264
+0.10
0.06
Negative Logits
<bos>
-3.46
AssemblyCompany
-0.75
ⓧ
-0.74
<?
-0.69
/***
-0.68
EndProject
-0.66
addComponent
-0.66
/*!
-0.64
HideFlags
-0.63
GTCX
-0.63
POSITIVE LOGITS
affor
1.35
chrysler
1.22
milf
1.21
maneu
1.21
pollut
1.21
lidl
1.19
impra
1.15
practition
1.13
accla
1.11
lamborghini
1.08
Activations Density 0.173%