INDEX
    Explanations

    expressions of feelings of neglect or abandonment

    New Auto-Interp
    Negative Logits
     mini
    -0.15
    ogh
    -0.14
    dney
    -0.14
     Svens
    -0.14
    vida
    -0.14
    ÏĢοÏĤ
    -0.13
     moons
    -0.13
    ãĥĨãĥ«
    -0.13
    ansi
    -0.13
    墨
    -0.13
    POSITIVE LOGITS
     alone
    0.52
     Alone
    0.42
    alone
    0.40
    -alone
    0.36
     lone
    0.34
     AL
    0.33
     solo
    0.31
    _al
    0.30
    Al
    0.30
     along
    0.29
    Act Density 0.027%

    No Known Activations