INDEX
    Explanations

    the word "changes" at different levels of magnitude

    references to modifications or alterations in policies or regulations

    New Auto-Interp
    Negative Logits
    ographies
    -0.72
     AFB
    -0.70
    Äĩ
    -0.69
     DRAGON
    -0.67
    ZE
    -0.65
    ãĥ¥
    -0.65
    amina
    -0.65
    Write
    -0.65
    ILLE
    -0.65
    zees
    -0.61
    POSITIVE LOGITS
    uits
    0.99
     wrought
    0.98
     effected
    0.89
    hops
    0.87
    ettings
    0.83
    undown
    0.81
    hift
    0.79
    oodoo
    0.78
    hooting
    0.76
    oldown
    0.75
    Act Density 0.039%

    No Known Activations