INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Your
    0.74
     Investment
    0.74
     কর্ম
    0.71
    opin
    0.70
    持续
    0.70
     your
    0.68
     Consciousness
    0.68
    σ
    0.67
    Healthcare
    0.67
     invested
    0.67
    POSITIVE LOGITS
     criterion
    1.16
     dracon
    1.11
     критери
    1.04
     critères
    0.97
     loophole
    0.96
     discriminatory
    0.96
     criteria
    0.93
     incentivize
    0.90
     requisito
    0.90
     kriter
    0.90
    Act Density 0.078%

    No Known Activations