INDEX
Explanations
quotations from spokespeople
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.21
1.0%
667
+0.11
0.5%
1870
+0.09
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
321
+0.21
0.03
351
+0.11
0.02
667
+0.09
0.02
Negative Logits
<bos>
-2.80
-1.06
ⓧ
-1.05
<?
-0.96
/**
-0.91
/*!
-0.90
/***
-0.89
<?
-0.87
///**
-0.79
/*
-0.78
POSITIVE LOGITS
bandung
1.16
Minang
1.16
spokespersons
1.09
unlaw
1.08
maneu
1.07
jaya
1.06
affor
1.06
territo
1.06
spokesman
1.04
sovere
1.04
Activations Density 0.077%