INDEX
    Explanations

    expressions and phrases indicating oddity or strangeness

    New Auto-Interp
    Negative Logits
     referenties
    -0.77
    شرين
    -0.67
     بتاريخ
    -0.67
    timbangkan
    -0.67
    IsContent
    -0.66
     ComVisible
    -0.66
     coals
    -0.66
     stomat
    -0.64
    createCanvas
    -0.64
    ciutto
    -0.62
    POSITIVE LOGITS
     strange
    1.26
     Strange
    1.25
    strange
    1.17
     Weird
    1.16
     weird
    1.16
    Strange
    1.12
     bizarre
    1.12
    weird
    1.11
     wierd
    1.10
     étrange
    1.09
    Act Density 0.125%

    No Known Activations