INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ResponseType
    0.43
     Fundraising
    0.42
    вершена
    0.42
     നടപ
    0.41
    seqNum
    0.40
     dựng
    0.37
    Markets
    0.37
     Hiring
    0.36
     mengeluarkan
    0.36
    Der
    0.35
    POSITIVE LOGITS
     penetrates
    0.98
     flowing
    0.98
     penetrate
    0.95
     flowed
    0.92
     flows
    0.90
     flow
    0.89
     travels
    0.86
     enters
    0.85
    进入
    0.82
     trapped
    0.77
    Act Density 0.076%

    No Known Activations