INDEX
    Explanations

    terms associated with intelligence or cleverness

    New Auto-Interp
    Negative Logits
     restrooms
    -0.67
    culus
    -0.67
     genital
    -0.66
    electric
    -0.64
    healthy
    -0.64
    inventory
    -0.63
    MN
    -0.62
     Cats
    -0.61
    alg
    -0.61
    culated
    -0.61
    POSITIVE LOGITS
    icht
    1.17
    entimes
    1.13
    iness
    0.86
    nown
    0.82
    rag
    0.80
    linger
    0.78
    lich
    0.77
    mann
    0.76
    wald
    0.76
    ueller
    0.74
    Act Density 0.007%

    No Known Activations