INDEX
    Explanations

    individuals and their roles or actions

    New Auto-Interp
    Negative Logits
    ixed
    -0.16
    ile
    -0.14
     nearby
    -0.14
    ã썿ĢĿãģĨ
    -0.14
    ici
    -0.14
    emark
    -0.14
     echang
    -0.13
    eded
    -0.13
    ál
    -0.13
    out
    -0.13
    POSITIVE LOGITS
     last
    0.25
     previously
    0.24
     earlier
    0.24
     yesterday
    0.21
    Earlier
    0.20
     hadn
    0.20
     originally
    0.20
     first
    0.20
     had
    0.19
     Earlier
    0.19
    Act Density 0.214%

    No Known Activations