INDEX
    Explanations

    names of political figures and locations

    the letter "s" in various contexts

    New Auto-Interp
    Negative Logits
    Shock
    -0.66
    yz
    -0.66
    sie
    -0.64
    yk
    -0.62
    CVE
    -0.61
    Els
    -0.61
     Osc
    -0.60
    PN
    -0.59
    ov
    -0.59
    Alert
    -0.58
    POSITIVE LOGITS
    etheless
    0.98
    wered
    0.92
    pecially
    0.82
     grandson
    0.80
    ensibly
    0.78
    outhern
    0.77
     successor
    0.77
     oldest
    0.76
    ources
    0.75
    uddenly
    0.75
    Act Density 0.184%

    No Known Activations