INDEX
Neuron Alignment
Index
Value
% of L₁
50
+0.18
1.0%
172
+0.14
0.8%
1480
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
36
+0.18
0.04
1480
+0.14
0.03
1235
+0.11
0.03
Negative Logits
<bos>
-3.14
/***
-0.91
<?
-0.73
///**
-0.73
-0.69
//---
-0.67
/*
-0.66
#
-0.66
/*!
-0.65
ⓧ
-0.65
POSITIVE LOGITS
volunte
1.78
ecru
1.75
fortn
1.70
impra
1.60
unlaw
1.60
affor
1.59
ibiza
1.58
maneu
1.58
madonna
1.53
tolerably
1.52
Activations Density 0.084%