INDEX
    Explanations

    occurrences of the phrase "for you" and its variations

    New Auto-Interp
    Negative Logits
     simp
    -0.17
    ilar
    -0.16
    еÑĢп
    -0.15
    atters
    -0.15
    bles
    -0.14
     exe
    -0.14
    æ®
    -0.14
    ibly
    -0.14
    ihil
    -0.14
     оÑģÑĤан
    -0.14
    POSITIVE LOGITS
    nl
    0.16
    agate
    0.15
    orang
    0.15
    zdy
    0.15
    iendo
    0.15
    gone
    0.14
     Bail
    0.14
    ÑĤик
    0.14
     Paladin
    0.14
    ARED
    0.14
    Act Density 0.033%

    No Known Activations