INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    .handle
    -0.07
    MER
    -0.07
    Merc
    -0.07
    othy
    -0.07
    .apache
    -0.07
     Manning
    -0.07
     Mob
    -0.07
    True
    -0.07
     mish
    -0.07
    Conn
    -0.07
    POSITIVE LOGITS
     légèrement
    0.11
     포함
    0.10
     optionally
    0.10
     commentary
    0.10
     قلي
    0.09
     punctuation
    0.09
     إثر
    0.09
     ligeramente
    0.09
    摘要
    0.09
     accompanying
    0.09
    Act Density 0.043%

    No Known Activations