INDEX
    Explanations

    mentions of the letter 'h'

    New Auto-Interp
    Negative Logits
    ãĤ´ãĥ³
    -0.79
    terday
    -0.71
    lihood
    -0.68
     mine
    -0.66
    ãĥ¯
    -0.65
     Beware
    -0.65
    Leaks
    -0.63
     flush
    -0.61
     succeeding
    -0.59
    Ô
    -0.59
    POSITIVE LOGITS
    oused
    1.28
    ulk
    1.19
    anging
    1.17
    anky
    1.16
    ospital
    1.15
    ousing
    1.13
    ollywood
    1.12
    olly
    1.12
    uddled
    1.11
    acking
    1.11
    Act Density 0.015%

    No Known Activations