INDEX
Explanations
technical terms and abbreviations in a text related to technology or academia
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1265
+0.09
0.3%
30
+0.09
0.3%
1527
+0.08
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1834
+0.09
0.02
484
+0.09
0.02
1956
+0.08
0.02
Negative Logits
Eksteraj
-0.74
Biografía
-0.74
Ilustra
-0.74
Alguna
-0.73
Muerte
-0.70
Pued
-0.69
Todavía
-0.68
Història
-0.68
Conclusión
-0.67
Producción
-0.66
POSITIVE LOGITS
disagre
1.49
maneu
1.44
intersper
1.43
reluct
1.42
disreg
1.42
apprehen
1.40
impra
1.34
scrat
1.34
suscep
1.34
cushi
1.28
Activations Density 0.034%