INDEX
Explanations
notifications or alerts posted within a website or platform
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.20
1.1%
1677
+0.09
0.5%
1008
+0.09
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1008
+0.20
0.05
1861
+0.09
0.05
1677
+0.09
0.04
Negative Logits
<bos>
-3.15
ⓧ
-1.17
<?
-1.05
/**
-1.01
/*
-0.85
/*!
-0.84
-0.80
/***
-0.79
springfox
-0.75
///**
-0.74
POSITIVE LOGITS
maneu
1.87
increa
1.81
affor
1.80
impra
1.77
lele
1.73
bandung
1.73
stockholm
1.72
strick
1.67
wien
1.65
unspeak
1.63
Activations Density 0.161%