INDEX
    Explanations

    expressions of joy and satisfaction

    expressions of joy and pleasure

    New Auto-Interp
    Negative Logits
    enhagen
    -0.83
    ©¶æ
    -0.82
     mater
    -0.72
    prison
    -0.70
    eworld
    -0.68
     restraining
    -0.68
    road
    -0.68
    ioxide
    -0.67
    vernment
    -0.67
    xia
    -0.66
    POSITIVE LOGITS
    fully
    0.97
     delight
    0.91
    joy
    0.91
    ILY
    0.90
     delighted
    0.87
    iously
    0.76
    ingly
    0.75
    ously
    0.72
    lance
    0.72
     aston
    0.72
    Act Density 0.012%

    No Known Activations