INDEX
    Explanations

    words related to work and effort

    New Auto-Interp
    Negative Logits
     Ukrain
    -1.20
    wcs
    -0.90
     constitu
    -0.88
    iren
    -0.84
    anamo
    -0.80
    EStream
    -0.79
     champagne
    -0.77
     Flavoring
    -0.77
    ylon
    -0.75
     Bubble
    -0.72
    POSITIVE LOGITS
     ethic
    1.70
    flows
    1.57
    station
    1.55
    bench
    1.43
    horse
    1.42
    aday
    1.36
    manship
    1.36
    forces
    1.18
    tops
    1.15
    hops
    1.14
    Act Density 5.333%

    No Known Activations