INDEX
Explanations
words related to actions or commands
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
690
+0.15
0.5%
1510
+0.14
0.4%
468
+0.12
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
655
+0.15
0.08
690
+0.14
0.05
509
+0.12
0.08
Negative Logits
perciò
-0.98
poichè
-0.90
vuol
-0.85
specialmente
-0.85
purtroppo
-0.84
pertanto
-0.83
sappi
-0.81
apparti
-0.80
occorre
-0.79
persino
-0.78
POSITIVE LOGITS
resultList
0.75
newList
0.65
responseData
0.62
mList
0.60
.
0.59
jsonString
0.58
getID
0.57
rehensive
0.56
loggedIn
0.55
unless
0.54
Activations Density 0.667%