INDEX
    Explanations

    proper nouns and specific names

    New Auto-Interp
    Negative Logits
     Bux
    -1.37
     Dux
    -1.31
     Ox
    -1.24
    Ox
    -1.20
     Foxx
    -1.18
    Cx
    -1.16
     Dax
    -1.14
     Cox
    -1.12
     Mox
    -1.11
     Bex
    -1.10
    POSITIVE LOGITS
    IsContent
    0.84
    PostMapping
    0.77
    routeProvider
    0.74
    jkl
    0.74
    >",
    
    0.71
    tfrac
    0.71
    makeConstraints
    0.70
    Слу
    0.69
     Inti
    0.69
    findpost
    0.68
    Act Density 1.668%

    No Known Activations