INDEX
    Explanations

    quantifiers and modifiers that express degrees or intensity

    New Auto-Interp
    Negative Logits
    SequentialGroup
    -0.69
    Geplaatst
    -0.67
     Infórmanos
    -0.64
     ſche
    -0.63
     beſte
    -0.62
     GenerationType
    -0.59
    roned
    -0.59
    -0.58
    MessageTagHelper
    -0.58
    Хьажоргаш
    -0.58
    POSITIVE LOGITS
     very
    0.47
     extraordinarily
    0.42
     extremely
    0.39
     too
    0.39
     climate
    0.39
    .
    0.38
     exceptionally
    0.38
     stør
    0.35
    zuführen
    0.35
     harsh
    0.33
    Act Density 0.035%

    No Known Activations