INDEX
    Explanations

    phrases indicating continuation, endurance, or lack of decline in various phenomena

    phrases indicating the persistence or continuity of trends and conditions

    New Auto-Interp
    Negative Logits
    adelphia
    -0.77
    chens
    -0.67
    omers
    -0.67
     Walters
    -0.66
    eenth
    -0.65
     Centers
    -0.64
    chool
    -0.63
    quet
    -0.62
    ttes
    -0.62
     ILCS
    -0.62
    POSITIVE LOGITS
     truce
    0.82
     decay
    0.77
     progress
    0.75
     improvement
    0.74
     aggression
    0.74
     differentiation
    0.73
     bias
    0.72
     validation
    0.72
    improve
    0.72
     signal
    0.71
    Act Density 0.069%

    No Known Activations