INDEX
Explanations
phrases related to controlling or "letting" something happen, especially in regards to ownership or responsibility
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
869
+0.09
0.3%
1372
+0.09
0.2%
1105
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
869
+0.09
0.04
2026
+0.09
0.04
1685
+0.07
0.03
Negative Logits
soudain
-0.70
:,,
-0.69
quares
-0.69
parlant
-0.68
proposée
-0.66
imprimée
-0.65
foon
-0.64
fordable
-0.64
vierge
-0.64
matel
-0.63
POSITIVE LOGITS
continue
0.57
postIndex
0.57
allow
0.52
proceed
0.52
freely
0.52
let
0.50
happen
0.49
settle
0.48
succede
0.48
procede
0.48
Activations Density 0.256%