INDEX
Explanations
the word "Norwegian" in the text
references to Norway and its cultural or governmental contexts
New Auto-Interp
Negative Logits
place
-0.80
ually
-0.80
ual
-0.80
plain
-0.80
ept
-0.79
BOOK
-0.74
xxxxxxxx
-0.74
###
-0.72
uers
-0.72
vP
-0.70
POSITIVE LOGITS
Bok
1.07
Norway
1.05
Oslo
0.98
Norwegian
0.97
wegian
0.93
Refugee
0.84
Andersen
0.82
borg
0.81
Stockholm
0.80
Ã¥
0.76
Activations Density 0.008%