INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .Tr
    -0.08
    -0.07
     хвор
    -0.07
    ITE
    -0.06
     İslâm
    -0.06
    _RANDOM
    -0.06
    ंख
    -0.06
    大学
    -0.06
    "F
    -0.06
    -0.06
    POSITIVE LOGITS
     Bow
    0.17
     bow
    0.16
    bow
    0.14
     bows
    0.14
    Bow
    0.12
     bowed
    0.12
     Bowman
    0.10
     Bowie
    0.09
     Bowen
    0.08
    OW
    0.08
    Act Density 0.003%

    No Known Activations