INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ")],
    0.75
     هنگام
    0.74
     obtained
    0.73
    0.72
     tijdens
    0.70
     Verified
    0.69
     DURING
    0.68
    が入
    0.68
    ລິ
    0.67
     during
    0.67
    POSITIVE LOGITS
    End
    1.24
     End
    1.20
     end
    1.15
    終わり
    1.14
    结束
    1.10
    end
    1.04
     era
    1.04
    END
    1.02
     END
    1.02
     Ended
    0.95
    Act Density 0.066%

    No Known Activations