INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    er
    -0.77
    Oise
    -0.67
     pursuant
    -0.63
    </b>
    -0.62
     Crowe
    -0.61
    <b>
    -0.60
    </i>
    -0.60
    arean
    -0.59
    ه
    -0.59
    ра
    -0.59
    POSITIVE LOGITS
     most
    1.55
    most
    1.45
    MOST
    1.28
     MOST
    1.27
    Most
    1.25
     Most
    1.17
     meeste
    1.13
     fleste
    1.11
     flesta
    1.09
     meisten
    1.09
    Act Density 0.100%

    No Known Activations