INDEX
    Explanations

    forms of the word "best" and its variations

    New Auto-Interp
    Negative Logits
    arra
    -0.18
     rod
    -0.17
     rag
    -0.17
     Ra
    -0.17
     rat
    -0.16
     RS
    -0.16
    urum
    -0.16
    rat
    -0.16
    atra
    -0.15
     Rom
    -0.15
    POSITIVE LOGITS
    rev
    0.40
    REV
    0.29
    Rev
    0.26
     REV
    0.26
    ÑĢев
    0.24
     Rev
    0.23
     rev
    0.22
    _rev
    0.22
     Riv
    0.22
    .rev
    0.21
    Act Density 0.007%

    No Known Activations