INDEX
Explanations
emotional expressions conveying love and relationships
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.40
1.7%
2019
+0.21
0.9%
736
+0.19
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
2019
+0.40
0.10
1959
+0.21
0.09
736
+0.19
0.07
Negative Logits
<bos>
-1.82
Kategor
-1.71
télécharge
-1.64
karton
-1.61
kompati
-1.61
Secrétaire
-1.59
Konkur
-1.58
kosme
-1.58
biograf
-1.57
stoff
-1.54
POSITIVE LOGITS
pecuniary
0.84
rehensive
0.79
tiously
0.79
gewohnt
0.75
materialistic
0.75
snart
0.74
rictions
0.74
unfore
0.74
carbons
0.73
barbacoa
0.73
Activations Density 0.603%