INDEX
    Explanations

    the pronoun "he" in various contexts

    New Auto-Interp
    Negative Logits
    اÙĦ
    -0.67
     Crush
    -0.59
     Voltage
    -0.58
    Fight
    -0.57
     Splash
    -0.57
     Interest
    -0.56
     daytime
    -0.56
     Electronics
    -0.56
     Ammunition
    -0.55
     independently
    -0.55
    POSITIVE LOGITS
    eding
    1.29
    redit
    1.29
    eded
    1.29
    ctic
    1.26
    aping
    1.24
    aps
    1.22
    uristic
    1.19
    aving
    1.19
    arers
    1.17
    ctor
    1.15
    Act Density 0.114%

    No Known Activations