INDEX
    Explanations

    words indicating alternatives or substitutions

    New Auto-Interp
    Negative Logits
    .opensource
    -0.15
     ActiveForm
    -0.15
    å±Ĭ
    -0.15
    å±Ĩ
    -0.15
    @dynamic
    -0.14
    âķIJ
    -0.14
     Marino
    -0.14
    à¥ģà¤Ł
    -0.14
     cin
    -0.14
    OMEM
    -0.14
    POSITIVE LOGITS
    vez
    0.16
     usual
    0.15
    instead
    0.15
    ãĥ¼ãĤ¯
    0.14
    742
    0.14
     instead
    0.14
    afen
    0.14
    elerik
    0.14
    ='".
    0.14
    å®ľ
    0.14
    Act Density 0.030%

    No Known Activations