INDEX
Explanations
content that reflects social and political systems, particularly related to governance and civic responsibility
New Auto-Interp
Head Attr Weights
0:0.01
1:0.02
2:0.08
3:0.09
4:0.32
5:0.02
6:0.10
7:0.12
8:0.04
9:0.04
10:0.06
11:0.05
Negative Logits
axter
-1.52
oliath
-1.48
ecided
-1.48
eared
-1.47
endon
-1.42
ignty
-1.41
achev
-1.41
wat
-1.40
bothers
-1.39
danger
-1.37
POSITIVE LOGITS
Plex
1.59
nets
1.39
REST
1.39
Waves
1.36
supplemented
1.34
vectors
1.33
Notting
1.33
mechanisms
1.32
tunnels
1.29
corridors
1.28
Activations Density 0.064%