INDEX
Explanations
phrases related to conflicts and dispute resolution
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.17
1.0%
411
+0.14
0.8%
1870
+0.13
0.8%
Correlated Neurons
Index
P. Corr.
Cos Sim.
411
+0.17
0.03
615
+0.14
0.02
860
+0.13
0.02
Negative Logits
<bos>
-3.33
-0.93
<?
-0.83
/**
-0.70
ⓧ
-0.68
/*!
-0.66
#![
-0.61
/***
-0.60
add
-0.58
<?
-0.58
POSITIVE LOGITS
stockholm
1.48
bandung
1.36
affor
1.35
kyo
1.35
jaya
1.30
aen
1.28
wien
1.26
Minang
1.24
conflic
1.23
haup
1.23
Activations Density 0.068%