INDEX
Explanations
links or descriptions related to accessing full content, such as images or articles
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.21
1.2%
1145
+0.11
0.7%
577
+0.10
0.6%
Correlated Neurons
Index
P. Corr.
Cos Sim.
297
+0.21
0.06
577
+0.11
0.06
899
+0.10
0.05
Negative Logits
<bos>
-3.08
<>
-0.77
///**
-0.74
/***
-0.73
/*!
-0.70
//---
-0.65
/**
-0.62
<?
-0.61
introduce
-0.60
//--
-0.60
POSITIVE LOGITS
stockholm
1.42
bandung
1.40
Minang
1.39
maneu
1.36
affor
1.36
frankfurt
1.30
unlaw
1.30
venuto
1.29
Juf
1.29
napoli
1.27
Activations Density 0.116%