INDEX
    Explanations

    quantifiers or articles that denote singular and plural forms

    New Auto-Interp
    Negative Logits
     suivantes
    -0.69
     armée
    -0.64
    er
    -0.62
     Octobre
    -0.62
     restantes
    -0.62
     anul
    -0.61
     argint
    -0.59
    Ch
    -0.59
     acido
    -0.58
     ahogy
    -0.58
    POSITIVE LOGITS
    {}",
    1.13
     a
    1.12
    }")
    
    1.07
    một
    1.06
    "):
    
    1.05
    ^(@
    1.05
    )";
    
    1.04
    "){
    
    1.04
     ")
    
    1.02
    !")
    
    1.01
    Act Density 0.011%

    No Known Activations