INDEX
Explanations
proper nouns related to countries, with a specific focus on "Norway"
mentions of Norway and related terms or entities
New Auto-Interp
Negative Logits
ually
-0.97
ual
-0.83
place
-0.81
eenth
-0.80
ience
-0.79
uality
-0.79
uers
-0.78
icts
-0.76
uation
-0.76
icago
-0.75
POSITIVE LOGITS
wegian
0.90
wich
0.81
gger
0.73
Ô
0.70
borg
0.70
Bok
0.68
Cla
0.67
pheus
0.66
MAL
0.65
poon
0.65
Activations Density 0.056%