INDEX
    Explanations

    intensifiers that modify adjectives or adverbs, particularly emphasizing degree

    New Auto-Interp
    Negative Logits
    ÑĨÑĥ
    -0.07
    rama
    -0.07
    SizeMode
    -0.06
    _pref
    -0.06
    ville
    -0.06
    zilla
    -0.06
    bart
    -0.06
    ãģĹãģĭ
    -0.06
    rys
    -0.06
    Snap
    -0.06
    POSITIVE LOGITS
     quot
    0.07
    otto
    0.07
    edly
    0.07
    chal
    0.06
    ioc
    0.06
     Abrams
    0.06
    .vs
    0.06
     мала
    0.06
    lect
    0.06
     sen
    0.06
    Act Density 0.006%

    No Known Activations