INDEX
Explanations
specific legal concepts and terminology related to citizenship and nationality
New Auto-Interp
Negative Logits
AMERICA
-0.83
ENGLAND
-0.83
england
-0.78
England
-0.77
america
-0.75
america
-0.74
Амери
-0.71
Inglaterra
-0.69
America
-0.69
Angleterre
-0.68
POSITIVE LOGITS
Japanese
1.35
Italian
1.35
Mexican
1.34
Brazilian
1.29
Canadian
1.25
German
1.22
Polish
1.21
Finnish
1.21
Chinese
1.20
Russian
1.18
Activations Density 0.798%