INDEX
Explanations
expressions of gratitude and feedback in a formal or positive manner
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.18
1.1%
1296
+0.11
0.6%
31
+0.11
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
667
+0.18
0.05
1296
+0.11
0.04
31
+0.11
0.04
Negative Logits
<bos>
-3.30
<?
-0.82
-0.81
/***
-0.75
ⓧ
-0.75
/**
-0.74
<?
-0.73
deinit
-0.71
/*!
-0.70
bzero
-0.65
POSITIVE LOGITS
Minang
1.61
bandung
1.47
thut
1.44
mef
1.44
stockholm
1.39
maneu
1.38
Juf
1.37
Jambi
1.36
increa
1.35
Banjar
1.35
Activations Density 0.141%