INDEX
    Explanations

    relationships and dynamics involving manipulation, support, and moral decisions

    New Auto-Interp
    Negative Logits
     pole
    -0.14
    assel
    -0.14
    enberg
    -0.14
     preferably
    -0.14
    елем
    -0.14
    actus
    -0.13
    ôt
    -0.13
    RefCount
    -0.13
    858
    -0.13
    posal
    -0.13
    POSITIVE LOGITS
     anyway
    1.10
    Anyway
    0.95
     anyways
    0.95
     Anyway
    0.92
     anyhow
    0.81
     toch
    0.44
     nonetheless
    0.42
     nevertheless
    0.40
    Nevertheless
    0.39
     Nonetheless
    0.38
    Act Density 0.918%

    No Known Activations