INDEX
    Explanations

    long verbs related to evaluating, analyzing, or causing risks or potential harm

    words related to risk, scrutiny, and political concepts

    New Auto-Interp
    Negative Logits
    REE
    -0.65
    auri
    -0.61
    erity
    -0.59
    ree
    -0.59
     Wad
    -0.57
     Anger
    -0.56
    Notable
    -0.55
    tags
    -0.55
    ritz
    -0.55
    elf
    -0.55
    POSITIVE LOGITS
    izing
    2.82
    ized
    2.82
    ization
    2.65
    izes
    2.61
    ize
    2.61
    ised
    2.56
    ising
    2.45
    izers
    2.44
    izations
    2.40
    isation
    2.35
    Act Density 0.198%

    No Known Activations