INDEX
Explanations
proper nouns, specifically the name "Holland."
mentions of the name "Holland."
New Auto-Interp
Negative Logits
culation
-0.87
ctic
-0.80
ramid
-0.77
ãĥ£
-0.76
ceed
-0.76
vae
-0.76
ccording
-0.74
itutes
-0.74
ormal
-0.73
notations
-0.73
POSITIVE LOGITS
Holland
0.98
inson
0.91
shire
0.90
stadt
0.84
Gardens
0.79
Gaal
0.76
Cob
0.75
ijn
0.74
Tunnel
0.73
Wink
0.73
Activations Density 0.010%