INDEX
    Explanations

    contrastive phrases indicating limitations or exceptions

    New Auto-Interp
    Negative Logits
    .scalablytyped
    -0.17
    PLUS
    -0.16
     zase
    -0.15
    /antlr
    -0.15
    shint
    -0.15
    ozÃŃ
    -0.14
    šť
    -0.14
    porter
    -0.14
    ColumnInfo
    -0.14
    غط
    -0.14
    POSITIVE LOGITS
     none
    0.38
     nowhere
    0.32
     None
    0.28
    none
    0.28
     among
    0.28
    None
    0.24
     NONE
    0.24
    among
    0.24
     Among
    0.24
    Among
    0.23
    Act Density 0.100%

    No Known Activations