INDEX
    Explanations

    phrases related to official orders or proclamations

    New Auto-Interp
    Negative Logits
    ortmund
    -0.81
    doms
    -0.75
    regate
    -0.74
    idences
    -0.72
     Flavoring
    -0.71
    çīĪ
    -0.70
     Increases
    -0.70
    arms
    -0.69
     VIDEOS
    -0.68
    ipeg
    -0.67
    POSITIVE LOGITS
     supposed
    1.00
     considered
    0.91
     going
    0.89
     aware
    0.89
     gonna
    0.87
     ready
    0.86
     barred
    0.85
     concerned
    0.83
     ashamed
    0.82
     able
    0.82
    Act Density 2.014%

    No Known Activations