INDEX
    Explanations

    references to communication and information sharing

    Text after apostrophes

    apostrophes and quotation marks

    New Auto-Interp
    Negative Logits
    kloped
    -0.74
    OGND
    -0.74
    IndentedString
    -0.69
    ScopeManager
    -0.68
     فريبيس
    -0.63
     invokingState
    -0.63
     estekak
    -0.63
    :✨
    -0.60
     kaarangay
    -0.60
    ArrowToggle
    -0.59
    POSITIVE LOGITS
    0.57
    apos
    0.49
    fs
    0.48
    ństwa
    0.47
    principalColumn
    0.47
    \'
    0.47
    rsquo
    0.47
     getItemId
    0.46
    0.44
     apos
    0.44
    Act Density 0.098%

    No Known Activations