INDEX
Explanations
contractions of "can not" and "can"
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
674
+0.12
0.4%
381
+0.10
0.3%
1974
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1415
+0.12
0.06
1974
+0.10
0.06
1124
+0.10
0.06
Negative Logits
Américas
-0.62
bakso
-0.56
ruinas
-0.56
Résultats
-0.54
Catedral
-0.54
parlamento
-0.53
churras
-0.53
Formazione
-0.52
Aé
-0.52
levure
-0.52
POSITIVE LOGITS
disreg
0.93
affor
0.82
0.78
reluct
0.75
impra
0.73
resear
0.73
increa
0.71
suscep
0.70
pollut
0.69
excru
0.69
Activations Density 0.202%