INDEX
    Explanations

    reference to legal principles and data analysis in various contexts

    New Auto-Interp
    Negative Logits
    <bos>
    -0.67
     morada
    -0.52
    -0.51
    uxxxx
    -0.50
    .*")]
    -0.47
    ########.
    -0.46
    vician
    -0.44
    cinta
    -0.43
     béné
    -0.43
     فاض
    -0.43
    POSITIVE LOGITS
     estekak
    0.86
     autorytatywna
    0.75
    ]='\
    0.67
     चीज़ों
    0.65
    ſelves
    0.64
    ſelf
    0.63
    ]--;
    0.63
     myſelf
    0.62
    RenderAtEndOf
    0.61
     iſt
    0.61
    Act Density 0.681%

    No Known Activations