INDEX
    Explanations

    sentences relating to mythical characters or entities

    Followed by non-English words/characters

    kalo tak, tetapi, だけど

    New Auto-Interp
    Negative Logits
    NUMX
    -0.65
    UnsafeEnabled
    -0.60
    __":
    -0.58
    这让
    -0.58
     ?',
    -0.57
    ])):
    -0.56
    Obrigada
    -0.56
    NOPQRST
    -0.56
     XNUMX
    -0.56
    pportun
    -0.55
    POSITIVE LOGITS
    Portály
    0.67
     already
    0.63
     still
    0.63
     not
    0.60
     sometimes
    0.58
    tagHelper
    0.57
     Already
    0.57
     Afraid
    0.53
    0.53
     مفص
    0.52
    Act Density 0.122%

    No Known Activations