INDEX
Explanations
mentions of the country "Norway"
mentions of Norway and its derivatives
New Auto-Interp
Negative Logits
ual
-0.80
place
-0.79
uers
-0.78
aging
-0.77
ually
-0.77
ept
-0.75
plain
-0.74
xxxxxxxx
-0.73
uring
-0.71
enance
-0.70
POSITIVE LOGITS
Bok
1.03
borg
0.95
Norway
0.93
wegian
0.92
Refugee
0.91
Norwegian
0.85
Oslo
0.84
Airlines
0.81
Dag
0.78
Ń·
0.75
Activations Density 0.016%