INDEX
    Explanations

    proper nouns and names in the text

    New Auto-Interp
    Negative Logits
    mam
    -0.16
    Łèĥ½
    -0.15
    že
    -0.15
    apg
    -0.15
    anton
    -0.15
    TEGR
    -0.14
     authorize
    -0.14
     authorized
    -0.14
    cul
    -0.14
    weeney
    -0.14
    POSITIVE LOGITS
    åºĦ
    0.14
     Force
    0.14
    zer
    0.14
    ytt
    0.14
    rol
    0.14
    ilet
    0.13
    /by
    0.13
    ramework
    0.13
     ti
    0.13
     expend
    0.13
    Act Density 0.050%

    No Known Activations