INDEX
Explanations
situations involving legal disputes and personal conflict
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.31
1.1%
1843
+0.12
0.4%
946
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.31
0.13
1843
+0.12
0.06
1097
+0.11
0.07
Negative Logits
<bos>
-3.34
<?
-0.68
InvalidProtocol
-0.65
-0.65
chè
-0.64
/*!
-0.63
Ceinture
-0.62
ftagPool
-0.60
/**
-0.60
springfox
-0.59
POSITIVE LOGITS
accla
1.35
maneu
1.35
resear
1.32
reluct
1.29
Juf
1.20
impra
1.19
unspeak
1.16
disgra
1.16
shenan
1.16
unve
1.15
Activations Density 0.736%