INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    isbn
    -0.06
    ستان
    -0.06
    оги
    -0.06
    went
    -0.06
    街道
    -0.06
    Producto
    -0.06
     Circular
    -0.06
     Dialogue
    -0.06
    Convert
    -0.06
    غان
    -0.06
    POSITIVE LOGITS
     jealous
    0.15
     jealousy
    0.12
     singly
    0.07
    LEV
    0.07
     devoted
    0.06
     Hydraulic
    0.06
    alous
    0.06
     november
    0.06
     Neptune
    0.06
     caract
    0.06
    Act Density 0.004%

    No Known Activations