INDEX
    Explanations

    definite articles and their variations in different languages

    New Auto-Interp
    Negative Logits
     kautta
    -0.67
     regionales
    -0.66
     Esau
    -0.64
     Pflanze
    -0.64
     küche
    -0.63
     pernas
    -0.62
     tarko
    -0.61
     învă
    -0.60
     juridiques
    -0.60
     temporales
    -0.60
    POSITIVE LOGITS
    ')],
    0.98
    "):
    
    0.98
    '):
    
    0.92
    %")
    0.90
    ")));
    
    0.90
    %";
    0.87
    "])
    
    0.87
    )];
    
    0.85
    ]='\
    0.85
    >",
    
    0.84
    Act Density 0.022%

    No Known Activations