INDEX
Explanations
specific parts or components specified in a text
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.16
0.9%
866
+0.14
0.8%
25
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
866
+0.16
0.05
25
+0.14
0.05
131
+0.13
0.04
Negative Logits
<bos>
-3.17
/***
-0.81
eliminate
-0.66
/*!
-0.65
<?
-0.63
///**
-0.63
/*
-0.63
-0.63
consolidate
-0.61
ⓧ
-0.60
POSITIVE LOGITS
stockholm
1.14
lidl
1.10
mef
1.10
aen
1.09
Parts
1.04
prétend
1.03
maroc
1.03
wien
1.03
lele
1.02
catég
1.02
Activations Density 0.122%