INDEX
    Explanations

    the presence of quotes or quotation marks

    Followed by punctuation

    New Auto-Interp
    Negative Logits
     den
    -0.47
    '
    -0.47
     đ
    -0.45
    ,
    -0.44
     o
    -0.44
     ProductService
    -0.43
    weiler
    -0.42
    agian
    -0.42
     <<<<<<<<<<<<<<
    -0.41
     deste
    -0.41
    POSITIVE LOGITS
    ,:);
    0.87
     Efq
    0.86
    ſelves
    0.86
     myſelf
    0.86
    ſelf
    0.85
     Monfieur
    0.83
    Демографія
    0.82
    ,:),
    0.78
     Jefus
    0.77
     pleaſure
    0.76
    Act Density 0.047%

    No Known Activations