INDEX
    Explanations

    phrases indicating the conclusion or termination of a discussion

    New Auto-Interp
    Negative Logits
     Roskov
    -0.74
    UserScript
    -0.72
    Bibliographie
    -0.71
     DataTypes
    -0.70
     ModelExpression
    -0.68
    WillAppear
    -0.67
     ContentValues
    -0.67
     defStyle
    -0.67
    хьтан
    -0.65
     useStyles
    -0.65
    POSITIVE LOGITS
     End
    0.71
    orses
    0.69
     END
    0.69
    angers
    0.67
    ANGER
    0.65
    owing
    0.60
    othermic
    0.58
    End
    0.57
    urable
    0.57
     end
    0.57
    Act Density 0.177%

    No Known Activations