INDEX
    Explanations

    the use of adverbs and phrases that characterize frequency or manner in a context

    New Auto-Interp
    Negative Logits
    ôle
    -1.44
     sit
    -1.40
    acker
    -1.36
    elle
    -1.35
    uto
    -1.35
     {
    -1.33
    meet
    -1.31
    ari
    -1.30
    noreply
    -1.28
    ette
    -1.27
    POSITIVE LOGITS
    2.79
    <|padding|>
    2.79
    2.79
    <|outofrange|>
    2.79
    2.79
    2.79
    č↵                       
    2.79
                             
    2.79
    2.79
                                                                                 
    2.79
    Act Density 1.405%

    No Known Activations