INDEX
    Explanations

    intensifiers or modifiers, specifically the word "very."

    New Auto-Interp
    Negative Logits
    IsContent
    -0.79
     проще
    -0.69
     Akhtar
    -0.68
    id
    -0.67
     ombre
    -0.67
     Jackman
    -0.67
    ded
    -0.67
     forthwith
    -0.66
     برانيه
    -0.66
    Griffin
    -0.65
    POSITIVE LOGITS
     very
    1.76
    Very
    1.60
     Very
    1.60
    very
    1.57
    VERY
    1.50
     VERY
    1.49
     très
    1.19
     sehr
    1.19
     muy
    1.18
    sehr
    1.18
    Act Density 0.069%

    No Known Activations