INDEX
Explanations
phrases related to citations, quotes, and repeated phrases
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1842
+0.17
0.5%
1978
+0.15
0.5%
1177
+0.14
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1299
+0.17
0.03
1415
+0.15
0.03
1510
+0.14
0.04
Negative Logits
<bos>
-0.65
accountId
-0.65
shewn
-0.59
resultList
-0.59
dimensionless
-0.57
loggedIn
-0.55
bituminous
-0.55
discontinuities
-0.54
diffusivity
-0.53
rtn
-0.53
POSITIVE LOGITS
fantaisie
0.78
quoique
0.70
nôtre
0.70
rédig
0.67
fabriqué
0.66
plais
0.64
célé
0.63
prédé
0.63
RSSSF
0.62
confé
0.61
Activations Density 0.404%