INDEX
Explanations
references to emotional moments or states
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1450
+0.09
0.2%
1662
+0.07
0.2%
695
+0.06
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1450
+0.09
0.02
1081
+0.07
0.04
1525
+0.06
0.03
Negative Logits
shenan
-1.36
unspeak
-1.33
depic
-1.30
reluct
-1.29
apprehen
-1.28
disagre
-1.28
hentai
-1.25
impra
-1.23
intersper
-1.23
maneu
-1.19
POSITIVE LOGITS
everywhere
0.72
uska
0.60
betweenstory
0.58
Walkover
0.58
hidden
0.57
onViewCreated
0.56
apellidos
0.55
hidden
0.55
actionMode
0.55
neté
0.55
Activations Density 0.452%