INDEX
    Explanations

    activities or descriptions related to coding and programming

    New Auto-Interp
    Negative Logits
     unison
    -0.61
    their
    -0.55
     prevalence
    -0.53
     aggregate
    -0.51
     similarity
    -0.51
     ASP
    -0.51
     Variant
    -0.51
    vari
    -0.51
    iciency
    -0.50
     defic
    -0.50
    POSITIVE LOGITS
     himself
    1.57
     his
    1.15
    His
    1.11
     herself
    1.09
     Himself
    1.07
     he
    1.05
    his
    1.04
    He
    0.97
     His
    0.90
     He
    0.84
    Act Density 0.887%

    No Known Activations