INDEX
    Explanations

    references to violations of laws or rules

    New Auto-Interp
    Negative Logits
    onto
    -0.18
    eller
    -0.18
    rose
    -0.16
    ellers
    -0.16
    coil
    -0.14
     ÑĢÑĥкаÑħ
    -0.14
    ilog
    -0.14
    els
    -0.14
    iky
    -0.14
    erals
    -0.14
    POSITIVE LOGITS
    icorn
    0.16
    oding
    0.15
    omik
    0.14
    šek
    0.14
    MimeType
    0.14
    dül
    0.14
    èĢħ
    0.14
    ersed
    0.14
    umont
    0.13
     fundamentals
    0.13
    Act Density 0.031%

    No Known Activations