INDEX
    Explanations

    disagreement and opinions

    New Auto-Interp
    Negative Logits
     in
    -2.75
     by
    -2.63
     you
    -2.55
     of
    -2.34
     to
    -2.31
     with
    -2.25
     as
    -1.98
     especially
    -1.94
     including
    -1.89
     other
    -1.87
    POSITIVE LOGITS
     étan
    1.66
    šia
    1.50
     exemples
    1.48
    anyuan
    1.44
    他們
    1.43
    Angebot
    1.40
     réfrig
    1.39
     Paramètres
    1.39
    1.37
    .=
    1.36
    Act Density 0.129%

    No Known Activations