INDEX
Explanations
occurrences of the full form of words with emphasis
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
297
+0.15
0.5%
577
+0.14
0.5%
1387
+0.11
0.4%
Correlated Neurons
Index
P. Corr.
Cos Sim.
577
+0.15
0.05
297
+0.14
0.05
1387
+0.11
0.04
Negative Logits
تعدى
-0.44
کات
-0.43
spake
-0.43
іде
-0.41
зробити
-0.41
الها
-0.40
ed
-0.40
Arrived
-0.40
abstractmethod
-0.40
близь
-0.39
POSITIVE LOGITS
Full
0.98
FULL
0.97
FULL
0.97
full
0.96
Full
0.93
full
0.88
getFull
0.86
affez
0.80
isFull
0.80
GRATIS
0.78
Activations Density 0.107%