INDEX
    Explanations

    linguistics

    New Auto-Interp
    Negative Logits
    780
    -0.07
    Storyboard
    -0.07
     fotografia
    -0.07
     Romance
    -0.07
    PIRE
    -0.07
    zać
    -0.07
     accelerating
    -0.07
    -0.07
     ferramentas
    -0.07
     Priority
    -0.07
    POSITIVE LOGITS
    impin
    0.08
    ourced
    0.08
    igned
    0.08
     Bahn
    0.07
     صالح
    0.07
     محور
    0.07
     Heck
    0.07
    bahn
    0.07
     Noch
    0.07
     laug
    0.07
    Act Density 0.000%

    No Known Activations