INDEX
Explanations
information related to historical figures and events, especially those in literature and science
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.22
1.9%
2019
+0.13
1.1%
1699
+0.10
0.9%
Correlated Neurons
Index
P. Corr.
Cos Sim.
478
+0.22
0.07
1699
+0.13
0.23
1535
+0.10
0.02
Negative Logits
ⓧ
-1.51
<?
-1.31
-1.31
/**
-1.30
<bos>
-1.29
<?
-1.09
/***
-1.06
/*
-1.03
springfox
-0.93
/*!
-0.88
POSITIVE LOGITS
maneu
1.27
lele
1.24
maroc
1.21
affor
1.19
véhic
1.15
impra
1.12
Juf
1.11
accla
1.10
Intere
1.08
aen
1.08
Activations Density 2.478%