INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _keyword
    -0.07
     pip
    -0.07
    .List
    -0.07
     Gh
    -0.06
     зн
    -0.06
    -0.06
     gelir
    -0.06
     Remark
    -0.06
     sentence
    -0.06
     District
    -0.06
    POSITIVE LOGITS
     Auto
    0.16
     auto
    0.16
    Auto
    0.13
    auto
    0.12
    _auto
    0.12
    	auto
    0.12
    _AUTO
    0.12
    aut
    0.12
     Aut
    0.11
    .aut
    0.11
    Act Density 0.047%

    No Known Activations