INDEX
    Explanations

    phrases that introduce authors or creators of content

    New Auto-Interp
    Negative Logits
    Ārējās
    -0.76
     Préférences
    -0.68
     ditemp
    -0.67
     démocr
    -0.66
    Datuak
    -0.65
     normaux
    -0.64
     suivants
    -0.64
     placée
    -0.64
    SequentialGroup
    -0.64
     overras
    -0.64
    POSITIVE LOGITS
     about
    0.60
    permitAll
    0.51
     kuhusu
    0.51
     concerning
    0.51
     ABOUT
    0.50
     despre
    0.49
    bout
    0.48
     apie
    0.47
     About
    0.47
     Sobre
    0.47
    Act Density 0.077%

    No Known Activations