INDEX
    Explanations

    sentences with a combination of specific verbs and pronouns

    New Auto-Interp
    Negative Logits
    <bos>
    -2.47
    .
    -0.66
    ,
    -0.60
     кло
    -0.54
    ?
    -0.53
     (
    -0.53
     launched
    -0.52
    :
    -0.52
     established
    -0.52
     revealed
    -0.51
    POSITIVE LOGITS
     maroc
    1.26
     bandung
    1.24
     affez
    1.23
     ananas
    1.18
     cioc
    1.16
     sentra
    1.11
     venuto
    1.09
     ristor
    1.07
     kokos
    1.06
     swarovski
    1.05
    Act Density 6.050%

    No Known Activations