INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    -1.06
     भी
    -1.05
     не
    -1.03
    -1.03
     ко
    -1.03
     الق
    -1.02
    াই
    -1.02
     про
    -1.02
     но
    -1.02
    的话
    -1.02
    POSITIVE LOGITS
    <bos>
    12.87
     fuf
    4.27
     encomp
    4.25
     fta
    4.25
     guarante
    4.24
     effe
    4.24
     squa
    4.21
     affor
    4.18
     secon
    4.14
     desir
    4.12
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.