INDEX
    Explanations

    references to occurrences in different locations or environments

    occurrences of the word "in" within various contexts

    New Auto-Interp
    Negative Logits
     accordingly
    -0.79
    something
    -0.76
     likewise
    -0.74
     instead
    -0.72
     even
    -0.71
    almost
    -0.70
    10000
    -0.70
    even
    -0.70
     almost
    -0.70
     whenever
    -0.70
    POSITIVE LOGITS
     sexes
    1.09
     genders
    0.87
     sender
    0.78
     physical
    0.75
     academia
    0.73
    BuyableInstoreAndOnline
    0.73
     literal
    0.69
     textual
    0.68
     verbal
    0.68
     hardware
    0.67
    Act Density 0.169%

    No Known Activations