INDEX
    Explanations

    still under development

    New Auto-Interp
    Negative Logits
     already
    0.91
     Already
    0.88
     כבר
    0.80
    Already
    0.80
     уже
    0.77
     вже
    0.75
    already
    0.73
     allerede
    0.73
     ইতিমধ্যেই
    0.71
     sudah
    0.69
    POSITIVE LOGITS
     remains
    0.79
     retains
    0.78
     остается
    0.75
     остаются
    0.70
     vestiges
    0.65
     남아
    0.65
     remain
    0.64
     plenty
    0.63
     fundamentally
    0.63
     remained
    0.61
    Act Density 0.082%

    No Known Activations