INDEX
    Explanations

    the presence of deprecated terms or outdated references

    New Auto-Interp
    Negative Logits
    arin
    -0.15
    edException
    -0.15
    ecta
    -0.15
     Insight
    -0.14
    phia
    -0.14
    oldt
    -0.14
    ÑĮко
    -0.14
    اÙĪØ±ÛĮ
    -0.14
     pragma
    -0.14
    >manual
    -0.14
    POSITIVE LOGITS
    urv
    0.17
    StackNavigator
    0.15
     bottom
    0.15
    roz
    0.15
     heter
    0.15
     vill
    0.15
    oding
    0.14
    Ïĥμ
    0.14
    ras
    0.14
     Bottom
    0.14
    Act Density 0.001%

    No Known Activations