INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    óz
    -0.15
    nder
    -0.15
     specially
    -0.14
     pun
    -0.14
    é½
    -0.14
    Prov
    -0.14
     (*.
    -0.14
     desc
    -0.14
    ês
    -0.13
    oren
    -0.13
    POSITIVE LOGITS
    åģ
    0.15
    ellig
    0.15
    ampa
    0.14
    طر
    0.14
    860
    0.13
    ilogy
    0.13
    756
    0.13
    اÙħبر
    0.13
    usra
    0.12
    uir
    0.12
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.