INDEX
    Explanations

    adjectives related to fairness or justifiability

    occurrences of the word "just" and its variations in different contexts

    New Auto-Interp
    Negative Logits
    xual
    -0.63
    pora
    -0.62
     Licensed
    -0.62
    2020
    -0.61
     challeng
    -0.61
    ccording
    -0.60
    antage
    -0.60
     Palestin
    -0.60
     Archdemon
    -0.58
     necks
    -0.58
    POSITIVE LOGITS
    ifiable
    1.53
    ifications
    1.32
    ified
    1.14
    ification
    1.05
    IFIED
    0.96
    ifiers
    0.96
    if
    0.95
    ifying
    0.94
    icia
    0.90
    ifier
    0.88
    Act Density 0.093%

    No Known Activations