INDEX
    Explanations

    comparative phrases that highlight differences or similarities between subjects

    New Auto-Interp
    Negative Logits
    tagHelperRunner
    -0.64
     HasFactory
    -0.47
    Hauptartikel
    -0.43
     ویکی‌پدی
    -0.41
    новниш
    -0.40
     cont
    -0.38
     मन
    -0.35
     rese
    -0.35
     conf
    -0.34
     loop
    -0.34
    POSITIVE LOGITS
     Compared
    1.15
    Compared
    1.13
    compared
    0.92
     compared
    0.91
    比起
    0.86
    相比
    0.84
     Comparing
    0.82
     сравнению
    0.77
    Comparing
    0.77
     comparación
    0.71
    Act Density 0.039%

    No Known Activations