INDEX
    Explanations

    terms related to policies, regulations, and assistance programs

    terms related to accountability and safeguards against harm

    New Auto-Interp
    Negative Logits
     Garfield
    -0.83
     Eliot
    -0.75
     Lovecraft
    -0.71
    aaaa
    -0.69
     thirteen
    -0.69
     Congratulations
    -0.67
     Floyd
    -0.66
     huh
    -0.65
    emonium
    -0.65
     Wonderful
    -0.65
    POSITIVE LOGITS
     emergencies
    0.86
     redund
    0.85
     backlog
    0.85
     licences
    0.83
     suppliers
    0.83
     biomark
    0.81
     corridors
    0.80
     ageing
    0.80
     migrants
    0.79
     processes
    0.77
    Act Density 0.629%

    No Known Activations