INDEX
Explanations
instructions related to printing documents
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.18
1.0%
410
+0.11
0.6%
1909
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
410
+0.18
0.04
789
+0.11
0.03
1425
+0.10
0.03
Negative Logits
<bos>
-3.22
ⓧ
-0.83
-0.77
<?
-0.72
Vegeu
-0.69
/**
-0.67
springfox
-0.66
/*
-0.63
/***
-0.63
Unmarshaller
-0.59
POSITIVE LOGITS
impra
1.52
maneu
1.46
affor
1.45
increa
1.41
emphat
1.39
reluct
1.35
unspeak
1.31
unlaw
1.31
exorbit
1.31
disagre
1.30
Activations Density 0.086%