INDEX
    Explanations

    mentions of positive reviews on Amazon

    occurrences of the word "on."

    New Auto-Interp
    Negative Logits
    auga
    -0.70
    eties
    -0.67
    mere
    -0.66
    EngineDebug
    -0.65
    arnaev
    -0.65
    htaking
    -0.64
    ¯¯
    -0.64
    ptive
    -0.62
    lethal
    -0.62
    amount
    -0.62
    POSITIVE LOGITS
     reddit
    1.45
     facebook
    1.44
     youtube
    1.41
     Youtube
    1.38
     Reddit
    1.35
     twitter
    1.34
     behalf
    1.32
     Github
    1.30
     forums
    1.25
     Facebook
    1.25
    Act Density 0.160%

    No Known Activations