INDEX
    Explanations

    instances of the word "same" or variations thereof

    same followed by qualifier

    New Auto-Interp
    Negative Logits
    popo
    -0.47
    PDC
    -0.47
     Uro
    -0.46
     hackers
    -0.45
     palanca
    -0.44
     povol
    -0.44
    PPC
    -0.44
    κος
    -0.43
     BPI
    -0.43
    fptr
    -0.43
    POSITIVE LOGITS
     same
    1.33
    Same
    1.32
     Same
    1.23
    same
    1.20
     SAME
    1.08
    SAME
    1.07
    zelfde
    0.92
    isSame
    0.88
     mesmas
    0.85
     samma
    0.81
    Act Density 0.040%

    No Known Activations