INDEX
Explanations
phrases related to collaboration, consultation, and working with specific groups or entities
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
939
+0.10
0.3%
1120
+0.10
0.3%
1499
+0.09
0.2%
Correlated Neurons
Index
P. Corr.
Cos Sim.
16
+0.10
0.05
946
+0.10
0.04
939
+0.09
0.05
Negative Logits
overcrow
-0.88
horrend
-0.82
impra
-0.81
underval
-0.80
;;)
-0.80
shenan
-0.80
indescri
-0.79
inconce
-0.76
ineffec
-0.75
disreg
-0.75
POSITIVE LOGITS
principalColumn
0.72
principalTable
0.68
WriteTagHelper
0.64
مرئيه
0.64
Portail
0.61
@[+][
0.58
للاسماء
0.58
récents
0.58
ivelany
0.58
सत्यापित
0.58
Activations Density 0.335%