INDEX
Explanations
time-related phrases, particularly related to spending time or going through books
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.22
0.9%
1013
+0.10
0.4%
1413
+0.09
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1013
+0.22
0.06
369
+0.10
0.04
1041
+0.09
0.04
Negative Logits
<bos>
-2.69
ⓧ
-0.80
/**
-0.79
addCriterion
-0.66
add
-0.66
<?
-0.64
class
-0.61
-0.59
setClass
-0.59
regulate
-0.58
POSITIVE LOGITS
lamborghini
1.26
venice
1.24
reft
1.23
perfon
1.23
fatis
1.22
chrysler
1.22
fays
1.21
errone
1.20
dises
1.20
ftu
1.19
Activations Density 0.420%