INDEX
    Explanations

    phrases indicating quality and suitability, particularly in product descriptions

    New Auto-Interp
    Negative Logits
    ↵↵
    -0.77
      
    -0.77
    <eos>
    -0.71
     […]
    -0.71
    -0.69
    хьтан
    -0.65
        
    -0.61
    Though
    -0.59
    ...
    -0.59
    )]=
    -0.59
    POSITIVE LOGITS
     تانيه
    0.78
    rungsseite
    0.76
    uxxxx
    0.74
     useStyles
    0.73
    NUMX
    0.71
    Datuak
    0.70
     useAuth
    0.70
    ](#
    0.70
     Савезне
    0.69
     ujednoznacz
    0.69
    Act Density 0.001%

    No Known Activations