INDEX
    Explanations

    greetings, phrases, and commands

    New Auto-Interp
    Negative Logits
    linkedin
    0.44
    használ
    0.42
     подход
    0.41
    hão
    0.41
    人士
    0.38
    âtel
    0.38
     दायरे
    0.38
     ಕ್ಷೇತ್ರದಲ್ಲಿ
    0.37
    0.37
    外部
    0.36
    POSITIVE LOGITS
     slogans
    1.11
     greetings
    1.09
     phrases
    1.07
     frases
    1.03
     affirmations
    1.03
     commands
    1.01
     mantras
    1.00
     greeting
    0.97
     praises
    0.92
     명령
    0.91
    Act Density 0.033%

    No Known Activations