INDEX
    Explanations

    mentions of specific authors or works in a review context

    New Auto-Interp
    Negative Logits
     of
    -0.53
    .
    -0.52
    <eos>
    -0.46
     in
    -0.46
     at
    -0.45
    -
    -0.45
     des
    -0.45
    del
    -0.42
    ↵↵
    -0.41
     los
    -0.41
    POSITIVE LOGITS
     nahilalakip
    1.18
     CreateTagHelper
    1.16
     للمعارف
    1.12
     estekak
    1.11
     gynhyrchwyd
    1.09
     فريبيس
    1.08
    expandindo
    1.07
    saraba
    1.03
     transfieras
    1.03
     كومونز
    1.02
    Act Density 0.220%

    No Known Activations