INDEX
    Explanations

    entities with the word "new"

    occurrences of the word "new."

    New Auto-Interp
    Negative Logits
     suspic
    -0.76
    uated
    -0.60
    staking
    -0.59
     Pav
    -0.59
    uate
    -0.58
     Reconstruction
    -0.56
     bulls
    -0.55
    istic
    -0.54
    ific
    -0.53
     Bravo
    -0.53
    POSITIVE LOGITS
    riter
    1.40
    ritten
    1.27
    estern
    1.19
    ITNESS
    1.09
    alker
    1.05
    esome
    1.05
    ORD
    1.04
    isdom
    1.03
    olf
    1.02
    erker
    1.01
    Act Density 0.037%

    No Known Activations