INDEX
    Explanations

    founding or destruction

    New Auto-Interp
    Negative Logits
    established
    -1.23
     established
    -1.17
     Established
    -1.13
     destruction
    -1.04
    Established
    -0.99
    CloseOperation
    -0.88
    destruction
    -0.88
     destrucción
    -0.86
     étab
    -0.85
    establishment
    -0.84
    POSITIVE LOGITS
    ly
    0.66
    ally
    0.63
    man
    0.63
    ry
    0.61
    s
    0.59
    men
    0.58
    tas
    0.58
     ra
    0.57
    y
    0.56
    ate
    0.56
    Act Density 0.238%

    No Known Activations