INDEX
Explanations
commentary or reviews of movies or shows
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.26
1.1%
906
+0.09
0.4%
382
+0.07
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1320
+0.26
0.10
906
+0.09
-0.01
322
+0.07
0.05
Negative Logits
<bos>
-2.35
/***
-0.81
ⓧ
-0.77
intersper
-0.75
ratify
-0.73
/**
-0.73
inaugurate
-0.72
defray
-0.70
springfox
-0.70
-0.68
POSITIVE LOGITS
âgé
0.84
originaire
0.79
soulign
0.72
épu
0.71
jouant
0.69
récomp
0.68
createSlice
0.67
bosco
0.67
espri
0.67
marié
0.67
Activations Density 1.850%