INDEX
    Explanations

    phrases emphasizing the concept of "most" or extremes in comparison

    New Auto-Interp
    Negative Logits
     Italijanski
    -0.63
     للاسماء
    -0.62
     ویکی‌پدی
    -0.62
    IsMutable
    -0.59
     wireType
    -0.58
    contentLoaded
    -0.57
    нгред
    -0.57
    ьаж
    -0.56
    ViewInit
    -0.55
    mpagne
    -0.54
    POSITIVE LOGITS
     most
    0.47
     Most
    0.41
     najbardziej
    0.40
    ftagPool
    0.40
    Most
    0.39
     today
    0.38
     terbesar
    0.38
    intégration
    0.37
     olsun
    0.37
    ที่สุด
    0.35
    Act Density 0.008%

    No Known Activations