INDEX
    Explanations

    the article "a" and its variations, indicating a focus on singular nouns

    New Auto-Interp
    Negative Logits
     صوتيه
    -0.74
     للمعارف
    -0.73
    RegressionTest
    -0.71
     kasarigan
    -0.71
    Hentet
    -0.69
     ujednoznacz
    -0.66
    SEGUIR
    -0.63
     Chwiliwch
    -0.63
    المكان
    -0.61
    GOTREF
    -0.60
    POSITIVE LOGITS
    with
    0.94
     WITH
    0.87
    With
    0.87
     With
    0.85
     with
    0.81
    WITH
    0.71
     avec
    0.69
     עם
    0.67
     Avec
    0.66
     با
    0.63
    Act Density 0.047%

    No Known Activations