INDEX
Explanations
references to citizenship and legal status
New Auto-Interp
Negative Logits
متعلقه
-0.59
θρω
-0.59
+#+#
-0.47
interviewers
-0.47
Farewell
-0.47
guys
-0.47
ーブ
-0.47
interviewer
-0.46
Jungs
-0.45
platte
-0.44
POSITIVE LOGITS
citizen
1.04
citoyen
0.98
resident
0.95
citizens
0.93
citizen
0.92
citizens
0.89
Citizen
0.88
citoyens
0.88
Citizens
0.87
citoy
0.85
Activations Density 0.620%