INDEX
Explanations
instances of the word "now" occurring in a text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.24
1.3%
1742
+0.12
0.6%
1671
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1742
+0.24
0.09
1671
+0.12
0.07
47
+0.11
0.06
Negative Logits
<bos>
-3.35
export
-0.63
Kontrola
-0.61
beans
-0.61
import
-0.61
public
-0.61
commit
-0.60
円
-0.58
SequentialGroup
-0.58
interface
-0.58
POSITIVE LOGITS
affor
1.85
maneu
1.85
unlaw
1.74
accla
1.68
hairc
1.68
stockholm
1.66
emphat
1.63
lidl
1.62
lamborghini
1.61
disreg
1.60
Activations Density 0.279%