INDEX
    Explanations

    adjectives related to intensity or focus

    terms related to politicized and emotionally charged topics

    New Auto-Interp
    Negative Logits
    redits
    -0.82
    uther
    -0.82
    registered
    -0.80
    verified
    -0.76
    âĿ
    -0.75
    chwitz
    -0.75
    drug
    -0.75
     besides
    -0.74
    Specific
    -0.73
    Film
    -0.73
    POSITIVE LOGITS
     nature
    1.19
     confines
    0.94
     portion
    0.90
     assumption
    0.87
     notion
    0.87
     aspect
    0.85
     process
    0.83
     version
    0.82
     phase
    0.82
     tendency
    0.81
    Act Density 0.313%

    No Known Activations