INDEX
    Explanations

    adjectives and phrases related to various qualities or conditions

    adjectives and their modifiers that describe conditions or states

    New Auto-Interp
    Negative Logits
     };
    -0.86
     ',
    -0.86
     '.
    -0.83
     guiName
    -0.77
    .",
    -0.77
     ."
    -0.76
     ,"
    -0.74
     ];
    -0.71
    %.
    -0.71
     ",
    -0.70
    POSITIVE LOGITS
    )
    1.89
    -)
    1.62
    )-
    1.59
    ?)
    1.59
    )'
    1.59
    *)
    1.58
    !)
    1.50
    )"
    1.49
    )/
    1.47
    )*
    1.42
    Act Density 0.305%

    No Known Activations