INDEX
    Explanations

    phrases related to completeness and thoroughness in descriptions or criteria

    New Auto-Interp
    Negative Logits
     الرياضيه
    -0.66
    WriteTagHelper
    -0.62
     ligiloj
    -0.61
    GEBURTSDATUM
    -0.58
    الحياه
    -0.57
    InitVars
    -0.56
    UnitTesting
    -0.56
    RenderAtEndOf
    -0.56
     saites
    -0.56
     فريبيس
    -0.55
    POSITIVE LOGITS
     entirely
    0.58
     completamente
    0.55
     completely
    0.54
     fully
    0.53
     полностью
    0.52
    Fully
    0.51
    entire
    0.51
    完全
    0.51
    Completely
    0.51
     Completely
    0.49
    Act Density 0.930%

    No Known Activations