INDEX
    Explanations

    phrases indicating permission or authorization

    New Auto-Interp
    Negative Logits
    mium
    -0.17
    odon
    -0.16
    noun
    -0.16
    InnerText
    -0.15
    imary
    -0.15
    rame
    -0.15
    /exp
    -0.14
    831
    -0.14
    .SelectedItems
    -0.14
    è¸
    -0.14
    POSITIVE LOGITS
    ADM
    0.16
    zn
    0.15
    aison
    0.15
    ee
    0.15
    eh
    0.15
    .wik
    0.14
    oppins
    0.14
     compar
    0.14
    ope
    0.14
    ilen
    0.13
    Act Density 0.041%

    No Known Activations