INDEX
Explanations
references to Switzerland, Denmark, or related national identifiers
New Auto-Interp
Negative Logits
Swiss
-0.78
Swiss
-0.70
swiss
-0.60
americas
-0.57
Danish
-0.54
America
-0.54
America
-0.53
swiss
-0.52
Danish
-0.50
AsUp
-0.50
POSITIVE LOGITS
Switzerland
0.98
Efq
0.88
Ανακτήθηκε
0.85
Monfieur
0.81
Switzerland
0.75
tartalomajánló
0.74
Theſe
0.73
Denmark
0.72
оригіналу
0.72
myſelf
0.72
Activations Density 0.016%