INDEX
    Explanations

    thoughts, beliefs, and opinions indicated by phrases like "in my mind", "no doubt in my mind", and "in our minds"

    New Auto-Interp
    Negative Logits
     unequal
    -0.75
    ammy
    -0.70
     Virt
    -0.66
    ModLoader
    -0.63
     bicycles
    -0.63
     pestic
    -0.61
     Abuse
    -0.61
    byn
    -0.60
    cific
    -0.60
    Iv
    -0.60
    POSITIVE LOGITS
    cells
    0.87
    ulsion
    0.79
    ings
    0.76
    swer
    0.74
    gdala
    0.73
    share
    0.72
    inations
    0.71
    sts
    0.71
    grass
    0.70
    terday
    0.69
    Act Density 0.063%

    No Known Activations