INDEX
    Explanations

    phrases or statements emphasizing the main idea or point in a discussion

    New Auto-Interp
    Negative Logits
    loh
    -0.15
    eway
    -0.15
    ration
    -0.15
    TargetException
    -0.15
    opis
    -0.14
    ippers
    -0.14
    опиÑģ
    -0.14
    jay
    -0.14
    aggio
    -0.14
    amage
    -0.13
    POSITIVE LOGITS
     point
    0.33
    point
    0.29
    -point
    0.27
     points
    0.26
    .point
    0.25
    points
    0.24
     Point
    0.24
     POINT
    0.23
    (point
    0.23
    (Point
    0.22
    Act Density 0.030%

    No Known Activations