INDEX
    Explanations

    instances of being compelled or obligated to act in a certain way

    New Auto-Interp
    Negative Logits
    dt
    -0.17
    ven
    -0.16
    nde
    -0.15
    анÑģи
    -0.14
    kova
    -0.14
    ç³»
    -0.14
    ãģŁãĤī
    -0.14
     Freed
    -0.14
    reo
    -0.14
    ssel
    -0.14
    POSITIVE LOGITS
     into
    0.25
     onto
    0.23
    into
    0.22
    forced
    0.21
     Forced
    0.20
     forced
    0.19
    onto
    0.19
    _into
    0.18
    ache
    0.18
    berger
    0.17
    Act Density 0.043%

    No Known Activations