INDEX
Explanations
references to historical and political events related to colonialism and nationalism
New Auto-Interp
Negative Logits
ispers
-0.72
eworks
-0.70
ancies
-0.66
ravings
-0.66
doms
-0.65
ptions
-0.64
interviews
-0.63
tails
-0.63
stamps
-0.62
asions
-0.62
POSITIVE LOGITS
conduit
0.74
spokesperson
0.72
contender
0.71
entity
0.71
sleeper
0.70
whore
0.69
affair
0.68
underdog
0.67
fighter
0.66
organism
0.66
Activations Density 14.413%