INDEX
    Explanations

    categorizing sentiment and emphasis

    New Auto-Interp
    Negative Logits
    juv
    0.45
     revend
    0.42
     unquestion
    0.39
     factoryName
    0.39
    ysł
    0.39
    чай
    0.38
    waitForIdleSync
    0.38
    insta
    0.38
     niez
    0.37
    PATCH
    0.37
    POSITIVE LOGITS
     ಚಿ
    0.43
     የወ
    0.40
    0.40
    Sche
    0.38
     Deity
    0.38
     Chaplin
    0.37
     Cot
    0.36
     ہندو
    0.35
     Burs
    0.35
    ക്കുറിച്ച
    0.35
    Act Density 0.005%

    No Known Activations