INDEX
    Explanations

    proper nouns, particularly names of individuals and organizations

    New Auto-Interp
    Negative Logits
    ISM
    -0.77
     Alley
    -0.68
    ry
    -0.65
    ICA
    -0.65
     optics
    -0.65
    åĭ
    -0.62
     shrug
    -0.61
     tripod
    -0.61
     Galile
    -0.60
    tz
    -0.58
    POSITIVE LOGITS
    creen
    1.09
    terness
    1.07
    omething
    1.05
    hiba
    1.05
    ocial
    0.95
    paces
    0.92
    heed
    0.92
    ession
    0.91
    pace
    0.90
    andra
    0.89
    Act Density 0.117%

    No Known Activations