INDEX
    Explanations

    contexts related to health conditions and their implications

    New Auto-Interp
    Negative Logits
     Beide
    -0.74
    Ambos
    -0.70
     beide
    -0.69
     Both
    -0.65
    Šaltiniai
    -0.65
     berdua
    -0.60
    兩人
    -0.60
     ffilmiau
    -0.59
    Diweddarwch
    -0.58
    Kedua
    -0.58
    POSITIVE LOGITS
     allemaal
    1.16
     all
    1.03
    etc
    0.92
    all
    0.90
     etc
    0.89
     usw
    0.82
     semuanya
    0.82
    0.79
    などなど
    0.78
    すべて
    0.78
    Act Density 0.431%

    No Known Activations