INDEX
    Explanations

    distinct items in lists

    New Auto-Interp
    Negative Logits
     جای
    0.80
    0.74
     oman
    0.72
    新たな
    0.72
    cić
    0.70
    さまざまな
    0.70
    0.68
    гото
    0.68
    ოს
    0.67
     मिळाले
    0.66
    POSITIVE LOGITS
     except
    1.11
    except
    0.96
    Even
    0.94
     despite
    0.92
     (
    0.92
     ($\
    0.91
    Despite
    0.91
     kecuali
    0.91
     sauf
    0.90
    ,
    0.88
    Act Density 0.676%

    No Known Activations