INDEX
Explanations
statements about accuracy, correctness, and compliance
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.20
1.0%
605
+0.09
0.5%
1013
+0.09
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
605
+0.20
0.06
1013
+0.09
0.14
468
+0.09
0.08
Negative Logits
<bos>
-3.40
/***
-0.87
-0.87
displayquote
-0.81
/*
-0.76
/**
-0.76
<tfoot>
-0.73
/*!
-0.73
Kontrola
-0.72
//---
-0.72
POSITIVE LOGITS
maroc
1.45
lidl
1.42
riviera
1.41
stockholm
1.33
milano
1.29
paradiso
1.29
bordeaux
1.29
outlander
1.29
matel
1.26
Meksi
1.26
Activations Density 1.774%