INDEX
    Explanations

    google edu/education links

    New Auto-Interp
    Negative Logits
     Verständ
    0.60
     résult
    0.59
     diminution
    0.58
     bioavailability
    0.57
    带动
    0.56
    RAMM
    0.55
     factorización
    0.55
     że
    0.55
     বন্ধ
    0.55
     cytotoxicity
    0.55
    POSITIVE LOGITS
     something
    0.59
     anything
    0.58
     any
    0.57
    something
    0.57
    anything
    0.55
     ANYTHING
    0.55
     ಏನು
    0.54
     ஏதாவது
    0.53
     everything
    0.53
    何か
    0.52
    Act Density 0.001%

    No Known Activations