INDEX
    Explanations

    terms that indicate functionality and performance in product reviews

    New Auto-Interp
    Negative Logits
    .createObject
    -0.15
    yle
    -0.15
    elage
    -0.14
    igos
    -0.14
    gil
    -0.14
    nech
    -0.14
    anson
    -0.14
     prostÅĻed
    -0.14
    ód
    -0.13
    âķĿ
    -0.13
    POSITIVE LOGITS
     out
    0.65
     straight
    0.44
    åĩº
    0.42
     Out
    0.41
    out
    0.39
    (out
    0.35
     OUT
    0.35
     из
    0.35
    straight
    0.35
     Straight
    0.34
    Act Density 0.027%

    No Known Activations