INDEX
Explanations
references to immigration and related policies
New Auto-Interp
Negative Logits
sonian
-0.92
ilts
-0.87
çͰ
-0.81
olls
-0.81
@#&
-0.79
Ü
-0.78
oeuv
-0.74
idges
-0.73
beit
-0.73
MpServer
-0.72
POSITIVE LOGITS
detention
1.06
reform
0.98
enforcement
0.94
policy
0.91
immigration
0.89
crackdown
0.87
policy
0.87
deportation
0.86
policies
0.86
visas
0.86
Activations Density 0.014%