INDEX
Explanations
mention of specific roles or activities related to a community or organization
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1896
+0.12
0.7%
1363
+0.12
0.6%
478
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1363
+0.12
0.11
1385
+0.12
0.14
1013
+0.10
0.15
Negative Logits
<bos>
-2.34
char
-0.96
pad
-0.94
Pad
-0.93
Rod
-0.93
str
-0.91
pop
-0.91
trim
-0.90
Pa
-0.89
string
-0.88
POSITIVE LOGITS
stockholm
2.54
eiffel
2.37
ecru
2.35
madonna
2.32
outlander
2.30
riviera
2.29
venice
2.26
milf
2.26
jurassic
2.26
bordeaux
2.26
Activations Density 2.668%