INDEX
    Explanations

    adverbs that modify actions or qualities in a descriptive manner

    New Auto-Interp
    Negative Logits
    k
    -0.60
    B
    -0.59
    b
    -0.58
    al
    -0.56
    an
    -0.56
    z
    -0.56
    вица
    -0.56
     vicente
    -0.55
    uncios
    -0.53
    в
    -0.52
    POSITIVE LOGITS
    AddTagHelper
    1.00
    BibitemShut
    0.86
    #
    0.86
    ']")
    0.85
    "],
    
    0.84
    ])]
    0.83
    SequentialGroup
    0.81
    sively
    0.81
    }%
    
    0.80
    "]),
    0.80
    Act Density 0.584%

    No Known Activations