INDEX
Explanations
references to pain and related conditions
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
156
+0.19
1.2%
143
+0.14
0.8%
321
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
143
+0.19
0.02
394
+0.14
0.02
361
+0.12
0.02
Negative Logits
hers
-2.05
yours
-1.94
vez
-1.77
mine
-1.63
ership
-1.59
geneal
-1.58
blogger
-1.52
trustworthy
-1.50
ours
-1.48
campus
-1.47
POSITIVE LOGITS
less
2.54
ting
2.18
ful
2.13
inflicted
2.06
lessly
2.06
relief
2.00
ingly
1.98
ĩ
1.91
lessness
1.84
faced
1.84
Activations Density 0.137%