INDEX
Explanations
words related to immigration and immigrant policies
New Auto-Interp
Neuron Alignment
Index
Value
% of L₁
50
+0.25
1.4%
1870
+0.13
0.7%
1178
+0.12
0.7%
Correlated Neurons
Index
P. Corr.
Cos Sim.
1178
+0.25
0.03
1634
+0.13
0.02
486
+0.12
0.02
Negative Logits
<bos>
-2.88
ⓧ
-0.83
<?
-0.81
uxxxx
-0.79
/**
-0.71
LookAnd
-0.69
JspWriter
-0.65
AssemblyCompany
-0.63
#
-0.62
-0.60
POSITIVE LOGITS
bandung
1.27
jaya
1.20
wien
1.17
Immigration
1.16
🤣🤣
1.14
véhic
1.13
Hæ
1.09
immigration
1.09
Præ
1.08
soulign
1.08
Activations Density 0.029%