INDEX
    Explanations

    phrases that express personal opinions or subjective beliefs

    New Auto-Interp
    Negative Logits
     ath
    -0.15
     Deniz
    -0.14
     Kidd
    -0.13
    вал
    -0.13
     Shrine
    -0.13
    logger
    -0.13
     Shuttle
    -0.13
    fore
    -0.13
    士
    -0.13
    еÑĢж
    -0.13
    POSITIVE LOGITS
    pone
    0.17
    adol
    0.15
    .li
    0.14
    idia
    0.13
    zeug
    0.13
    ços
    0.13
    HG
    0.13
    zim
    0.13
    iesel
    0.13
    zer
    0.13
    Act Density 0.075%

    No Known Activations