INDEX
    Explanations

    common function words and prepositions in the text

    New Auto-Interp
    Negative Logits
    ograd
    -0.17
    using
    -0.14
    ÙĪØ«
    -0.14
     kra
    -0.14
     اÙĦØ£ÙĨ
    -0.14
    elves
    -0.13
    obj
    -0.13
    roje
    -0.13
    اÙĦات
    -0.13
    amina
    -0.12
    POSITIVE LOGITS
     hap
    0.16
    uteur
    0.15
     Wal
    0.15
     Steven
    0.15
    zon
    0.14
    ihan
    0.14
    OfDay
    0.14
    anager
    0.14
    eline
    0.14
    baugh
    0.14
    Act Density 0.069%

    No Known Activations