INDEX
    Explanations

    phrases that indicate comparison and differences between items or conditions

    New Auto-Interp
    Negative Logits
    وط
    -0.54
    vart
    -0.50
    ubu
    -0.45
    unya
    -0.45
    bren
    -0.44
    سم
    -0.44
     Toute
    -0.44
     Shee
    -0.43
    var
    -0.43
    hadiran
    -0.43
    POSITIVE LOGITS
    principalColumn
    0.79
    WebVitals
    0.79
    IsMutable
    0.78
     kasarigan
    0.78
    ьаж
    0.76
    UnsafeEnabled
    0.69
    NameInMap
    0.67
     EconPapers
    0.67
    AnimationsModule
    0.65
    contentLoaded
    0.62
    Act Density 0.640%

    No Known Activations