INDEX
    Explanations

    phrases where someone is expressing an opinion or judgment

    descriptive language and terms relating to characterization and evaluation

    New Auto-Interp
    Negative Logits
     sqor
    -0.56
    existing
    -0.55
    ADS
    -0.53
    DAQ
    -0.52
    dinand
    -0.52
    ?'
    -0.52
    asaki
    -0.47
    MAS
    -0.47
    ablishment
    -0.47
    restling
    -0.46
    POSITIVE LOGITS
     "'
    2.19
     "â̦
    2.11
     "...
    2.06
     "[
    2.04
     "
    1.96
     "#
    1.78
     ".
    1.73
     "(
    1.72
     "@
    1.59
     ",
    1.56
    Act Density 0.755%

    No Known Activations