INDEX
Explanations
phrases related to technical skills, education, and training
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
1984
+0.13
0.4%
860
+0.11
0.3%
1403
+0.09
0.3%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1984
+0.13
0.04
860
+0.11
0.04
921
+0.09
0.03
Negative Logits
Hitam
-0.74
Kategor
-0.71
frambo
-0.71
jawa
-0.67
papà
-0.67
jaya
-0.67
moiselle
-0.67
bambou
-0.66
héro
-0.66
maske
-0.66
POSITIVE LOGITS
Sebagai
0.90
result
0.84
consequence
0.77
Și
0.71
sebagai
0.68
Jako
0.66
reminder
0.63
result
0.60
Dacă
0.59
Gdy
0.58
Activations Density 0.096%