INDEX
Explanations
words related to joy and positive emotions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1839
+0.15
0.8%
50
+0.14
0.7%
1406
+0.12
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1839
+0.15
0.09
869
+0.14
0.07
596
+0.12
0.06
Negative Logits
<bos>
-3.31
public
-0.75
HasColumnType
-0.69
protected
-0.65
묶
-0.64
SequentialGroup
-0.63
marshaller
-0.62
/*
-0.60
/**
-0.60
private
-0.60
POSITIVE LOGITS
jaya
1.75
wien
1.66
stockholm
1.64
Minang
1.62
bandung
1.60
thut
1.59
increa
1.57
aen
1.57
Juf
1.56
impra
1.56
Activations Density 0.759%