INDEX
Explanations
phrases related to policies or policy proposals
references to policy-related topics and discussions
New Auto-Interp
Negative Logits
Stain
-0.75
upon
-0.73
ITNESS
-0.73
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
-0.71
Templ
-0.69
hma
-0.69
FORMATION
-0.66
batch
-0.66
ymes
-0.65
apy
-0.65
POSITIVE LOGITS
makers
1.02
prescriptions
1.01
making
0.96
advisors
0.91
makers
0.91
initiatives
0.89
policy
0.89
interventions
0.88
advisers
0.88
advisor
0.88
Activations Density 0.035%