INDEX
    Explanations

    phrases that express a sense of reduction or decrease

    New Auto-Interp
    Negative Logits
    DockStyle
    -0.77
    skosten
    -0.74
    bingen
    -0.72
     CarPlay
    -0.71
    ‍♀️
    -0.71
     BoxDecoration
    -0.71
     Wib
    -0.70
    harusnya
    -0.70
     Kirkwood
    -0.69
     conviene
    -0.69
    POSITIVE LOGITS
     less
    1.48
     LESS
    1.45
     Less
    1.39
    Less
    1.36
    less
    1.22
     Lessing
    1.13
    LESS
    1.13
     moins
    1.05
     menos
    1.04
     lefs
    0.96
    Act Density 0.101%

    No Known Activations