INDEX
    Explanations

    adverbs or adjectives ending in 'ly'

    adjectives and adverbs related to strength and support

    New Auto-Interp
    Negative Logits
    çīĪ
    -0.94
    illions
    -0.77
    inea
    -0.77
     Duchess
    -0.74
    £ı
    -0.72
    士
    -0.72
    adelphia
    -0.70
     Deaths
    -0.69
    ा
    -0.68
     Millions
    -0.67
    POSITIVE LOGITS
     (>
    0.74
     ambition
    0.72
     ambitions
    0.71
    gradient
    0.69
     defenses
    0.69
    ellow
    0.68
     directional
    0.68
     differentiation
    0.68
     tendency
    0.66
     characterization
    0.65
    Act Density 0.297%

    No Known Activations