INDEX
    Explanations

    articles/prepositions

    New Auto-Interp
    Negative Logits
     अनेक
    -0.09
     बह
    -0.08
     altos
    -0.08
    -0.08
     निम
    -0.08
    stantial
    -0.07
    .rob
    -0.07
     निर्ण
    -0.07
     ganhos
    -0.07
     premios
    -0.07
    POSITIVE LOGITS
    ”,
    0.09
    replace
    0.09
    ctp
    0.08
     Boll
    0.08
     Wenn
    0.08
    ?”,
    0.08
    ","+
    0.08
    \",\"
    0.08
    ?”.
    0.08
     ?");↵
    0.08
    Act Density 0.057%

    No Known Activations