INDEX
    Explanations

    substitute for professional advice

    New Auto-Interp
    Negative Logits
     only
    1.11
     unambiguously
    0.98
    only
    0.98
     ONLY
    0.97
    ONLY
    0.95
     Only
    0.95
     ostensibly
    0.92
     Both
    0.87
     synchronously
    0.86
     both
    0.85
    POSITIVE LOGITS
     nor
    0.85
    Nor
    0.81
    Mid
    0.77
    0.77
    Глав
    0.75
     Nor
    0.75
     গিয়ে
    0.74
    ဘူး
    0.74
     গিয়ে
    0.72
    nor
    0.72
    Act Density 0.030%

    No Known Activations