INDEX
    Explanations

    during, among, within, via

    New Auto-Interp
    Negative Logits
     first
    -1.98
     for
    -1.88
    其余
    -1.77
     gesund
    -1.68
     only
    -1.63
    -1.60
     by
    -1.56
     where
    -1.56
     in
    -1.55
     الأولى
    -1.55
    POSITIVE LOGITS
     flera
    1.89
     nästan
    1.78
    k
    1.77
    1.77
    1.77
     naturligt
    1.76
    ignment
    1.76
    -\\
    1.75
     möjlighet
    1.72
    oting
    1.68
    Act Density 0.079%

    No Known Activations