INDEX
    Explanations

    mentions of actions related to authority or exclusive rights

    occurrences of the substring "rog" in various contexts

    New Auto-Interp
    Negative Logits
    birth
    -0.70
    aved
    -0.68
     goodbye
    -0.65
     Fas
    -0.64
    ICAN
    -0.57
    rix
    -0.57
     Machina
    -0.56
    INESS
    -0.56
    sever
    -0.56
    fig
    -0.55
    POSITIVE LOGITS
    raphic
    1.32
    raphics
    1.22
    raph
    1.20
    allery
    1.07
    roup
    1.06
    aming
    0.99
    rams
    0.95
    atory
    0.95
    roups
    0.93
    ues
    0.92
    Act Density 0.045%

    No Known Activations