INDEX
    Explanations

    the word "Off" with the strongest activation levels

    words related to the concept of "off" or "offering."

    New Auto-Interp
    Negative Logits
    izabeth
    -0.70
    ridor
    -0.64
    growth
    -0.63
     dehydration
    -0.62
     skelet
    -0.62
    ascript
    -0.61
     federation
    -0.61
    omething
    -0.60
    enment
    -0.60
    ATHER
    -0.60
    POSITIVE LOGITS
     Off
    3.65
    Off
    2.44
     OFF
    2.42
    off
    1.88
    OFF
    1.80
     off
    1.56
    offs
    1.36
     Offer
    1.33
     Away
    1.27
     Down
    1.23
    Act Density 0.006%

    No Known Activations