INDEX
    Explanations

    proper nouns, particularly names and places associated with a cultural or historical context

    New Auto-Interp
    Negative Logits
    ipay
    -0.18
    MBProgressHUD
    -0.16
    TEL
    -0.14
     Op
    -0.14
    itez
    -0.14
    æķ·
    -0.13
    plash
    -0.13
    Bes
    -0.13
    ismu
    -0.13
     architecture
    -0.13
    POSITIVE LOGITS
    vak
    0.16
    ecess
    0.16
    emens
    0.15
    nergy
    0.15
    rok
    0.14
    mention
    0.14
    prot
    0.14
     Dere
    0.14
    cake
    0.14
     hare
    0.14
    Act Density 0.005%

    No Known Activations