INDEX
    Explanations

    names or titles referring to individuals or characters, particularly in a playful or provocative context

    New Auto-Interp
    Negative Logits
    imet
    -0.14
    BOSE
    -0.14
    emet
    -0.14
    зÑĭ
    -0.14
    ç¸
    -0.14
     councils
    -0.14
    sembled
    -0.14
    åĴ¨
    -0.13
    ALSE
    -0.13
    chio
    -0.13
    POSITIVE LOGITS
     jas
    0.20
    Cum
    0.18
     Cum
    0.18
    69
    0.17
     hot
    0.16
     Lex
    0.16
    232
    0.16
    cum
    0.15
    Hot
    0.15
    .hot
    0.14
    Act Density 0.032%

    No Known Activations