INDEX
Explanations
descriptions of past experiences with TV shows or movies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1013
+0.09
0.3%
393
+0.09
0.2%
1415
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
393
+0.09
0.06
1415
+0.09
0.02
458
+0.08
0.06
Negative Logits
regardé
-0.62
userEmail
-0.55
postData
-0.53
purcha
-0.52
michelin
-0.51
viendra
-0.50
constaté
-0.50
préfé
-0.50
écout
-0.50
userModel
-0.50
POSITIVE LOGITS
revival
1.02
revived
0.98
revive
0.94
resurrected
0.90
redis
0.85
reviving
0.85
resurgence
0.83
revi
0.83
resurrect
0.82
resur
0.80
Activations Density 0.904%