INDEX
Explanations
the concept of transformation or change, particularly in relation to processes or paths
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
292
+0.14
0.8%
61
+0.13
0.7%
302
+0.13
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
154
+0.14
0.06
329
+0.13
0.06
14
+0.13
0.06
Negative Logits
prospective
-1.67
ely
-1.63
agment
-1.61
oved
-1.58
ocene
-1.51
current
-1.51
%%
-1.43
elastic
-1.42
ceptible
-1.39
autiful
-1.39
POSITIVE LOGITS
Bag
1.74
kie
1.68
ibles
1.63
iffs
1.58
night
1.49
ney
1.45
yards
1.45
ids
1.41
mans
1.35
drug
1.33
Activations Density 0.038%