INDEX
    Explanations

    negative terms or critiques in the context of a review or commentary

    negative descriptors related to quality or performance

    New Auto-Interp
    Negative Logits
    awei
    -0.90
     Breach
    -0.79
     Moor
    -0.78
     Harrington
    -0.71
    utterstock
    -0.67
     Buchanan
    -0.66
    versive
    -0.66
     Farrell
    -0.64
    rity
    -0.64
    reatment
    -0.64
    POSITIVE LOGITS
    bian
    0.88
    itsch
    0.85
    icans
    0.79
    daq
    0.78
     sclerosis
    0.77
    Magikarp
    0.76
    gebra
    0.73
    Args
    0.72
    eless
    0.72
    estyle
    0.71
    Act Density 0.020%

    No Known Activations