INDEX
Explanations
phrases related to technology and software troubleshooting
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.12
0.5%
375
+0.07
0.3%
1013
+0.07
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1919
+0.12
0.07
1197
+0.07
0.05
1665
+0.07
0.07
Negative Logits
<bos>
-2.95
-0.93
/**
-0.90
Autoritní
-0.86
ⓧ
-0.83
<?
-0.74
/*
-0.71
/***
-0.67
ándor
-0.65
}],
-0.65
POSITIVE LOGITS
meis
2.01
lele
1.72
kaos
1.68
ohr
1.66
franz
1.63
Kategor
1.62
kram
1.61
aen
1.58
mef
1.58
wien
1.56
Activations Density 1.353%