INDEX
    Explanations

    words indicating visibility or clarity

    terms indicating visibility or clarity of certain situations or conditions

    New Auto-Interp
    Negative Logits
    uden
    -0.79
    nan
    -0.65
    bey
    -0.64
    pping
    -0.63
    ggies
    -0.62
    miah
    -0.62
     tightly
    -0.61
    uese
    -0.61
    akin
    -0.61
    berman
    -0.60
    POSITIVE LOGITS
    iary
    1.24
    aneously
    0.89
     contradictions
    0.88
    ibility
    0.88
    ible
    0.86
     Signs
    0.78
     discrepancies
    0.75
    ibly
    0.74
    iator
    0.74
    URE
    0.74
    Act Density 0.032%

    No Known Activations