INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     |:
    -0.07
     NAMES
    -0.07
     catapult
    -0.06
     euth
    -0.06
     CHE
    -0.06
     tsp
    -0.06
    -employed
    -0.06
     grapes
    -0.06
     Southeast
    -0.06
    Это
    -0.06
    POSITIVE LOGITS
    jam
    0.07
    unc
    0.06
     turf
    0.06
     Position
    0.06
    Position
    0.06
     goats
    0.06
    king
    0.06
    horse
    0.06
    storms
    0.06
    орів
    0.06
    Act Density 0.011%

    No Known Activations