INDEX
    Explanations

    what something depends on

    New Auto-Interp
    Negative Logits
    不允许
    0.73
    0.69
    積極的に
    0.69
     প্রস্তাবে
    0.67
     الأش
    0.66
    ിക്കുകയും
    0.64
    哥哥
    0.64
    atrième
    0.63
     mencegah
    0.63
     Proposal
    0.62
    POSITIVE LOGITS
     depends
    4.97
     depend
    4.83
     depending
    4.80
    depends
    4.36
    depending
    4.33
     dependent
    4.27
     Depends
    4.25
     depended
    4.25
     depende
    4.23
    Depends
    4.16
    Act Density 2.451%

    No Known Activations