INDEX
    Explanations

    concepts that arise or reside

    New Auto-Interp
    Negative Logits
     whereas
    0.50
     Basically
    0.43
     అయితే
    0.43
     instead
    0.41
     Although
    0.41
     basically
    0.41
    沒有
    0.41
     although
    0.40
     Unable
    0.40
     Whereas
    0.40
    POSITIVE LOGITS
     comes
    1.04
     arises
    0.99
     lies
    0.93
     emerges
    0.89
    comes
    0.87
     lur
    0.86
     resides
    0.84
     rests
    0.83
     Comes
    0.81
     возникает
    0.77
    Act Density 0.013%

    No Known Activations