INDEX
    Explanations

    mentions of age

    New Auto-Interp
    Negative Logits
     comp
    -0.66
     summ
    -0.64
     artic
    -0.63
     tant
    -0.62
     condu
    -0.62
     conspic
    -0.62
     coupled
    -0.61
     Scient
    -0.61
     spir
    -0.60
     communication
    -0.60
    POSITIVE LOGITS
    old
    4.39
    olds
    3.26
    OLD
    2.60
    older
    2.56
    olding
    2.27
    olded
    1.98
    Old
    1.92
    ould
    1.65
     old
    1.51
     olds
    1.37
    Act Density 0.033%

    No Known Activations