INDEX
    Explanations

    proper names of individuals

    New Auto-Interp
    Negative Logits
    <bos>
    -1.78
    Enllaces
    -0.60
    /**
    -0.60
    -0.57
    Cyfeiriadau
    -0.57
     mobil
    -0.56
    rid
    -0.56
    Життєпис
    -0.55
     Cecil
    -0.54
    <?
    -0.53
    POSITIVE LOGITS
     dave
    1.61
     Dave
    1.51
    Dave
    1.44
    dave
    1.19
     bandeau
    1.11
     beaute
    1.03
     swarovski
    1.02
     blackpink
    1.00
     pettico
    0.98
     vété
    0.97
    Act Density 0.395%

    No Known Activations