INDEX
Explanations
mentions of specific geographical locations or landmarks
references to specific geographical locations or entities, particularly islands
New Auto-Interp
Negative Logits
ents
-0.72
iously
-0.68
SG
-0.68
ital
-0.67
1959
-0.65
elaide
-0.65
lished
-0.65
[+
-0.65
ENTS
-0.63
ellar
-0.63
POSITIVE LOGITS
Isle
1.37
Isles
1.01
Royale
0.96
Enix
0.81
ortment
0.80
sburg
0.76
mania
0.75
wright
0.73
pload
0.73
ignt
0.72
Activations Density 0.013%