INDEX
    Explanations

    various forms of the word "to" and related linguistic constructs

    the beginning of documents (the <bos> token).

    New Auto-Interp
    Negative Logits
    CloseOperation
    -0.71
    +#+
    -0.71
    uxxxx
    -0.63
     firebaseConfig
    -0.60
    Newswire
    -0.56
    قایناقلار
    -0.56
    DriverManager
    -0.54
     jsPsych
    -0.53
     ſy
    -0.53
     absten
    -0.52
    POSITIVE LOGITS
     respectivas
    0.46
     consideración
    0.46
    TokenNameDOT
    0.42
     porción
    0.41
     bersifat
    0.41
     şeyler
    0.41
     Mitarbeit
    0.40
     činnosti
    0.39
     necesaria
    0.39
     utilización
    0.39
    Act Density 1.988%

    No Known Activations