INDEX
    Explanations

    words associated with commands and directives

    New Auto-Interp
    Negative Logits
    illi
    -0.73
     Feder
    -0.67
    ritic
    -0.66
    tten
    -0.64
    ici
    -0.64
    IFA
    -0.64
     Garry
    -0.64
    illian
    -0.63
    rito
    -0.63
     CAM
    -0.63
    POSITIVE LOGITS
    up
    1.66
    ups
    1.45
    Up
    1.43
     up
    1.36
    UP
    1.33
     Up
    1.31
     ups
    1.22
     UP
    1.15
    upt
    1.06
     Ups
    1.00
    Act Density 0.036%

    No Known Activations