INDEX
    Explanations

    concepts related to gaining something, whether it be knowledge, access, or an advantage

    New Auto-Interp
    Negative Logits
     Roberts
    -0.66
     Stevenson
    -0.64
     Schröder
    -0.63
     me
    -0.62
     Peters
    -0.60
     собой
    -0.58
     (
    -0.58
     Me
    -0.57
     I
    -0.57
     precios
    -0.57
    POSITIVE LOGITS
     GAIN
    1.62
     Gains
    1.50
     Gain
    1.48
    gain
    1.38
     gains
    1.37
     gain
    1.36
    Gain
    1.34
    GAIN
    1.28
     gained
    1.27
    gains
    1.25
    Act Density 0.053%

    No Known Activations