INDEX
    Explanations

    prepositions and conjunctions

    New Auto-Interp
    Negative Logits
     faſt
    -0.55
     pleaſure
    -0.51
     enfans
    -0.47
     itſelf
    -0.47
     inſ
    -0.46
     preſent
    -0.45
     raiſ
    -0.45
    ſelves
    -0.45
     tranſ
    -0.44
     ſy
    -0.43
    POSITIVE LOGITS
     من
    1.71
    من
    1.09
     FROM
    0.91
    FROM
    0.84
     From
    0.84
     ومن
    0.83
     dari
    0.83
     from
    0.82
     מן
    0.82
    From
    0.81
    Act Density 0.000%

    No Known Activations