INDEX
    Explanations

    conversational elements related to personal experiences or problems

    New Auto-Interp
    Negative Logits
     pinulongan
    -0.51
    hyrchwyd
    -0.47
    IUrlHelper
    -0.46
    -0.46
     Мексичка
    -0.44
     âmes
    -0.44
    MethodManager
    -0.43
    匿名使用者
    -0.43
    Hentet
    -0.43
    ագրություններ
    -0.43
    POSITIVE LOGITS
    IVEREF
    0.42
     ComVisible
    0.40
    endpush
    0.40
    Exclu
    0.39
     trag
    0.39
     sekali
    0.39
    PYX
    0.39
     Paglinawan
    0.38
    phat
    0.37
    glow
    0.37
    Act Density 0.328%

    No Known Activations