INDEX
    Explanations

    references to benefits and advantages in various contexts

    New Auto-Interp
    Negative Logits
    erin
    -0.18
    urch
    -0.17
    vä
    -0.16
    :animated
    -0.16
    ìĭ
    -0.15
    ODULE
    -0.15
    vor
    -0.15
     赤
    -0.14
    гл
    -0.14
    ivan
    -0.14
    POSITIVE LOGITS
     benefits
    0.39
     Benefits
    0.35
     advantages
    0.33
    Benefits
    0.31
    antages
    0.28
     disadvantages
    0.25
     benefit
    0.23
    benef
    0.23
     advantage
    0.22
     avantaj
    0.21
    Act Density 0.073%

    No Known Activations