INDEX
    Explanations

    mentions or discussions related to academic or technical content, possibly with a focus on specific concepts or methodologies

    New Auto-Interp
    Negative Logits
     sappi
    -1.28
     dises
    -1.25
     vogli
    -1.20
     mef
    -1.18
     solidar
    -1.17
     abbra
    -1.14
     abr
    -1.14
     „,
    -1.11
     gius
    -1.10
     ordina
    -1.09
    POSITIVE LOGITS
     by
    1.03
     into
    0.84
     with
    0.74
    by
    0.69
     according
    0.68
     to
    0.67
     through
    0.67
     przez
    0.66
     extensively
    0.65
     against
    0.64
    Act Density 0.374%

    No Known Activations