INDEX
    Explanations

    occurrences of the word "the."

    New Auto-Interp
    Negative Logits
    __":
    
    -0.60
    elemField
    -0.60
    __':
    
    -0.59
     detailsNormal
    -0.58
     nonUne
    -0.58
    IUrlHelper
    -0.57
     myſelf
    -0.56
    exitRule
    -0.54
     autorytatywna
    -0.54
    Portály
    -0.54
    POSITIVE LOGITS
     hyö
    0.36
    st
    0.33
    DISABLE
    0.32
    вайтесь
    0.32
     levens
    0.31
    gin
    0.31
     ello
    0.30
    jat
    0.29
     typique
    0.29
     اعلام
    0.28
    Act Density 0.363%

    No Known Activations