INDEX
    Explanations

    instances of the word "but" and its variations, indicating contrast or opposition in the text

    New Auto-Interp
    Negative Logits
    icy
    -0.16
     numel
    -0.15
    GRES
    -0.15
    ìĤ¬ì§Ģ
    -0.15
    itas
    -0.15
    ipple
    -0.14
     reserves
    -0.14
    ÑİÑĢ
    -0.14
    usaha
    -0.14
    hrad
    -0.13
    POSITIVE LOGITS
     lay
    0.15
    ayne
    0.15
    ANTE
    0.15
     ofType
    0.14
    ogui
    0.14
    ilege
    0.14
    @js
    0.14
     nor
    0.13
    \Array
    0.13
     ger
    0.13
    Act Density 0.150%

    No Known Activations