INDEX
    Explanations

    specific symbols and characters in a text

    occurrences of the word "forbid" and its variations, alongside specific symbols and abbreviations

    New Auto-Interp
    Negative Logits
    uations
    -0.75
    urally
    -0.74
    uation
    -0.73
    orem
    -0.72
    eur
    -0.71
    emouth
    -0.70
    heed
    -0.68
    owan
    -0.66
    ciating
    -0.65
    ements
    -0.65
    POSITIVE LOGITS
    ļéĨĴ
    0.93
    bie
    0.87
    atri
    0.87
    earance
    0.87
    stract
    0.84
    é¾įåĸļ士
    0.74
    bian
    0.74
    icity
    0.72
    edia
    0.71
    Reply
    0.71
    Act Density 0.051%

    No Known Activations