INDEX
    Explanations

    occurrences of the word "you" and its variations

    New Auto-Interp
    Negative Logits
    hil
    -0.48
    ###
    -0.45
     szám
    -0.45
    Autoritní
    -0.44
     Stu
    -0.44
     Islam
    -0.42
     orientação
    -0.42
     olur
    -0.42
     /*#__
    -0.42
     таки
    -0.42
    POSITIVE LOGITS
    ſelves
    0.75
    tanleria
    0.74
    ConstraintMaker
    0.70
    UnusedPrivate
    0.69
    deelte
    0.65
    nasium
    0.64
     оригіналу
    0.62
    FieldBuilder
    0.62
    ſelf
    0.61
    SpringBootTest
    0.61
    Act Density 0.473%

    No Known Activations