INDEX
    Explanations

    information related to the German language

    New Auto-Interp
    Negative Logits
    Sharp
    -0.81
    Moore
    -0.77
    Matthew
    -0.75
    Spoiler
    -0.72
    Cold
    -0.72
    Bloom
    -0.72
    Luck
    -0.71
    RAW
    -0.70
    Background
    -0.70
    Daddy
    -0.69
    POSITIVE LOGITS
     qui
    1.09
     los
    1.05
     mi
    1.02
     si
    1.00
     ni
    0.99
     que
    0.99
     é
    0.98
     alle
    0.97
     est
    0.95
     Ã
    0.95
    Act Density 2.321%

    No Known Activations