INDEX
    Explanations

    occurrences of the word "show" and its variations

    New Auto-Interp
    Negative Logits
    :✨
    -0.36
     plegable
    -0.34
     kẻ
    -0.30
     フロ
    -0.30
     beloved
    -0.30
     EconPapers
    -0.29
     dintre
    -0.29
     зовут
    -0.29
    principalTable
    -0.28
     politics
    -0.28
    POSITIVE LOGITS
     showing
    2.11
    showing
    2.02
     Showing
    1.98
    Showing
    1.85
     SHOWING
    1.72
    showed
    1.57
     showed
    1.55
     mostrando
    1.47
     montrant
    1.44
     shown
    1.35
    Act Density 0.218%

    No Known Activations