INDEX
    Explanations

    articles and determiners in the text

    New Auto-Interp
    Negative Logits
     gouvernements
    -0.74
     sentiers
    -0.71
     pouvoirs
    -0.70
     épaules
    -0.69
     défis
    -0.68
     karta
    -0.67
     dítě
    -0.66
     gouttes
    -0.64
     vœux
    -0.64
     oreilles
    -0.63
    POSITIVE LOGITS
    Những
    1.02
    %")
    0.97
    mêmes
    0.97
    wsze
    0.96
    Οι
    0.96
     theses
    0.95
     οι
    0.94
     nakalista
    0.94
    }")
    
    0.93
    )";
    
    0.92
    Act Density 0.110%

    No Known Activations