INDEX
Explanations
numerical data related to personality assessments or psychological metrics
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.10
0.3%
1005
+0.09
0.3%
1343
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.10
0.06
1978
+0.09
0.06
1925
+0.09
0.05
Negative Logits
<bos>
-2.17
betweenstory
-0.85
Atsauces
-0.78
scalatest
-0.77
dafx
-0.76
'\\;'
-0.73
adaptiveStyles
-0.72
LookAnd
-0.72
ynb
-0.71
Chham
-0.71
POSITIVE LOGITS
elena
0.89
santiago
0.81
affor
0.80
sophie
0.80
strick
0.79
ricardo
0.79
alre
0.78
felipe
0.78
eduardo
0.77
sebastian
0.77
Activations Density 0.129%