INDEX
Explanations
verbs or expressions related to recognition, especially regarding actions or behaviors
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
889
+0.13
0.4%
1265
+0.11
0.4%
1023
+0.10
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
889
+0.13
0.02
1142
+0.11
0.02
1023
+0.10
0.02
Negative Logits
setlength
-0.54
Produzione
-0.53
zanah
-0.50
strix
-0.49
assas
-0.49
ibatkan
-0.48
phur
-0.48
interessanti
-0.46
Morfologia
-0.46
*)((
-0.46
POSITIVE LOGITS
acknowledge
1.06
acknowledging
1.06
acknowledgment
1.00
acknowledgement
0.99
Acknowled
0.99
acknowledged
0.98
acknowled
0.93
Acknowledge
0.92
acknowledges
0.90
apprehen
0.88
Activations Density 0.054%