INDEX
    Explanations

    phrases indicating ownership or personalization

    New Auto-Interp
    Negative Logits
     himself
    -0.17
    ikal
    -0.17
     herself
    -0.17
    inta
    -0.16
    disk
    -0.15
    INARY
    -0.15
    unts
    -0.14
     bÃŃ
    -0.14
    ous
    -0.14
     Mond
    -0.14
    POSITIVE LOGITS
    udy
    0.16
    aly
    0.15
    url
    0.15
    éĶ
    0.15
    ERGY
    0.14
    sworth
    0.14
    version
    0.14
    URL
    0.14
     version
    0.14
    erner
    0.14
    Act Density 0.039%

    No Known Activations