INDEX
Explanations
references to locations or geographical territories
references to geographical or political territories
New Auto-Interp
Negative Logits
odcast
-0.83
racted
-0.78
oster
-0.78
uster
-0.73
Case
-0.73
uth
-0.71
Bi
-0.70
asting
-0.68
Episode
-0.68
kus
-0.67
POSITIVE LOGITS
boundaries
0.90
conquered
0.89
Territories
0.86
territory
0.83
territories
0.83
Territory
0.79
inhabited
0.77
borders
0.77
bordering
0.76
sovereignty
0.75
Activations Density 0.025%