INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     IPL
    -0.07
    escape
    -0.07
     Developer
    -0.07
     amino
    -0.07
     olanak
    -0.07
    Hola
    -0.06
    -0.06
     git
    -0.06
    _routing
    -0.06
     flame
    -0.06
    POSITIVE LOGITS
     poor
    0.11
     Poor
    0.09
     impoverished
    0.07
     poorest
    0.07
     poverty
    0.07
    Poor
    0.07
     Poverty
    0.06
     there
    0.06
     Lowell
    0.06
    0.06
    Act Density 0.012%

    No Known Activations