INDEX
    Explanations

    numeric values and their patterns

    New Auto-Interp
    Negative Logits
    amel
    -0.16
    armor
    -0.15
     Rub
    -0.15
     रà¤ĸन
    -0.14
    озв
    -0.14
    Reply
    -0.14
    uti
    -0.13
    ronic
    -0.13
    anim
    -0.13
    sein
    -0.13
    POSITIVE LOGITS
    ubat
    0.17
    iche
    0.17
    icher
    0.15
    ÏĢÎŃ
    0.15
    à¥Ģय
    0.15
     Sinn
    0.15
    ugas
    0.14
    ÌĨ
    0.14
    _visibility
    0.14
    illis
    0.14
    Act Density 0.134%

    No Known Activations