INDEX
    Explanations

    phrases indicating responsibility or accountability

    phrases indicating responsibility or accountability

    New Auto-Interp
    Negative Logits
    Sport
    -0.83
    edin
    -0.75
    nets
    -0.75
    notations
    -0.71
    edu
    -0.70
    fair
    -0.70
    Tool
    -0.70
    Lab
    -0.68
    DragonMagazine
    -0.68
    Lim
    -0.67
    POSITIVE LOGITS
     maintaining
    1.06
     overseeing
    1.02
     regulating
    1.02
     safegu
    0.97
     keeping
    0.94
     ensuring
    0.94
     constructing
    0.91
     creating
    0.91
     upholding
    0.90
     preserving
    0.87
    Act Density 0.056%

    No Known Activations