INDEX
    Explanations

    occurrences of the word "this."

    New Auto-Interp
    Negative Logits
     кӀ
    -0.43
    ****/
    -0.38
    𝟹
    -0.36
    ۜ
    -0.36
    slot
    -0.36
    $_.
    -0.35
    /*!
    
    -0.35
    /*!
    -0.35
     slot
    -0.35
    ********/
    -0.34
    POSITIVE LOGITS
    parsedMessage
    0.62
    SharedDtor
    0.61
     חיצוניים
    0.60
     unggul
    0.50
     utafitiHapana
    0.48
     Italij
    0.48
     nonUne
    0.46
    Dichloropropane
    0.46
     tapaht
    0.45
     honom
    0.44
    Act Density 0.007%

    No Known Activations