INDEX
Explanations
references to nationalities, particularly focusing on German culture and language within a European context
New Auto-Interp
Negative Logits
CreateTagHelper
-0.71
للاسماء
-0.62
estekak
-0.62
ब्रेकडाउन
-0.60
Савезне
-0.57
Personensuche
-0.57
noDo
-0.57
WriteTagHelper
-0.55
Infórmanos
-0.54
sumpay
-0.54
POSITIVE LOGITS
German
0.62
French
0.58
Germany
0.57
German
0.57
Dutch
0.53
Germany
0.52
French
0.51
german
0.51
GERMAN
0.50
France
0.50
Activations Density 0.570%