INDEX
    Explanations

    expressions of humor and irony

    New Auto-Interp
    Negative Logits
    partials
    -0.16
    abox
    -0.16
    engan
    -0.16
    rib
    -0.15
    arrants
    -0.14
    ovic
    -0.14
    lag
    -0.13
    oper
    -0.13
    umi
    -0.13
    ISR
    -0.13
    POSITIVE LOGITS
    .Networking
    0.16
     Provid
    0.16
     Creator
    0.15
    Creator
    0.14
     Yah
    0.14
     Wort
    0.14
    oteca
    0.13
     ÐļÑĢа
    0.13
    Wa
    0.13
     Brewery
    0.13
    Act Density 0.099%

    No Known Activations