INDEX
Explanations
mentions of specific brands or products
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1404
+0.18
1.0%
1464
+0.17
0.9%
1178
+0.16
0.9%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1343
+0.18
0.07
227
+0.17
0.07
1056
+0.16
0.05
Negative Logits
Scénario
-0.63
devemos
-0.61
vorrei
-0.59
zví
-0.59
poichè
-0.58
occorre
-0.57
dobbiamo
-0.56
confira
-0.56
esclusivamente
-0.55
randomUUID
-0.55
POSITIVE LOGITS
Hen
0.79
Hen
0.77
Bens
0.72
Bem
0.71
Ot
0.70
Mend
0.69
Otter
0.68
silikon
0.67
Nadine
0.66
Henne
0.66
Activations Density 0.665%