INDEX
Explanations
occurrences of the pronoun "you."
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.48
2.0%
2019
+0.10
0.4%
204
+0.10
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.48
0.19
1415
+0.10
0.13
331
+0.10
0.10
Negative Logits
<bos>
-1.98
ã
-0.70
lomb
-0.66
MÁ
-0.65
Chá
-0.63
polie
-0.63
poliester
-0.63
bú
-0.62
universale
-0.61
ù
-0.60
POSITIVE LOGITS
disgra
0.91
accla
0.90
maneu
0.85
practition
0.81
chrysler
0.80
idéale
0.79
perfet
0.78
inev
0.77
heapq
0.76
reluct
0.76
Activations Density 0.802%