INDEX
    Explanations

    names of physical places or organizations

    words related to proof or evidence

    New Auto-Interp
    Negative Logits
    geist
    -0.81
    lasses
    -0.75
    mare
    -0.70
     count
    -0.67
     Eclipse
    -0.66
     Rite
    -0.66
    Reloaded
    -0.65
    Pie
    -0.65
    DEN
    -0.62
    mite
    -0.61
    POSITIVE LOGITS
    idence
    1.34
    idential
    1.20
    idently
    1.16
    isions
    1.15
    idences
    1.12
    iders
    1.02
    ident
    0.98
    esses
    0.96
    irus
    0.95
    identally
    0.93
    Act Density 0.014%

    No Known Activations