INDEX
    Explanations

    references to names or entities

    New Auto-Interp
    Negative Logits
    merce
    -0.66
    ilitating
    -0.66
    ibrary
    -0.65
    çīĪ
    -0.64
    roman
    -0.63
    enegger
    -0.61
    natureconservancy
    -0.60
    estate
    -0.60
    \/\/
    -0.59
    ilitation
    -0.59
    POSITIVE LOGITS
    Shift
    0.71
    ister
    0.68
    ly
    0.68
    leness
    0.64
    lev
    0.64
    essing
    0.63
    ochond
    0.61
    anni
    0.61
    arent
    0.60
     Amount
    0.59
    Act Density 0.249%

    No Known Activations