INDEX
    Explanations

    expressions related to goals or ambitions

    New Auto-Interp
    Negative Logits
    an
    -0.15
    bro
    -0.15
    edom
    -0.15
    ials
    -0.14
     hammer
    -0.14
    odore
    -0.14
    bil
    -0.14
    æĺł
    -0.14
    uous
    -0.14
    ught
    -0.14
    POSITIVE LOGITS
    lessly
    0.28
     Aim
    0.18
    tır
    0.18
    erais
    0.17
    /target
    0.16
    yr
    0.16
    lexport
    0.16
    yro
    0.16
    azon
    0.16
    ldr
    0.16
    Act Density 0.013%

    No Known Activations