INDEX
Explanations
references to Irish culture or people
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
421
+0.20
0.7%
61
+0.14
0.5%
172
+0.13
0.5%
Correlated Neurons
Index
P. Corr.
Cos Sim.
421
+0.20
0.04
172
+0.14
0.03
61
+0.13
0.03
Negative Logits
disambiguazione
-0.63
UnusedPrivate
-0.56
kasarigan
-0.55
Preço
-0.55
Tē
-0.55
Paglinawan
-0.54
RectangleBorder
-0.54
Τι
-0.54
vindo
-0.53
NKC
-0.53
POSITIVE LOGITS
Irish
1.30
irish
1.28
Ireland
1.21
Irish
1.18
ireland
1.14
Ireland
1.10
IRELAND
1.08
Irishman
1.08
jaya
0.99
intersper
0.97
Activations Density 0.095%