INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     বিদ্ব
    0.36
     multiplicative
    0.36
    iob
    0.35
     panic
    0.35
    िएशन
    0.35
     SRL
    0.35
     hatred
    0.35
     দৃঢ়
    0.34
     verifiable
    0.34
     fearful
    0.34
    POSITIVE LOGITS
     relaxing
    1.30
     relax
    1.23
    Relax
    1.15
    relax
    1.14
     relaxation
    1.11
     lounging
    1.09
     Relax
    1.07
     relaxes
    1.07
     relaxed
    1.01
    放松
    1.00
    Act Density 0.047%

    No Known Activations