INDEX
Explanations
semicolons at the end of sentences
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.21
1.5%
2019
+0.08
0.6%
382
+0.07
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
411
+0.21
0.07
1596
+0.08
0.06
1509
+0.07
0.06
Negative Logits
<bos>
-1.98
ⓧ
-1.07
-1.06
<?
-1.03
/***
-1.03
/**
-1.00
<?
-0.93
/*
-0.81
/*!
-0.80
///**
-0.78
POSITIVE LOGITS
véhic
1.02
maroc
0.97
alté
0.93
catég
0.93
délib
0.88
Minang
0.86
°;
0.86
habile
0.85
pleins
0.85
expériment
0.85
Activations Density 0.188%