INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    +:+
    -0.59
    كذا
    -0.51
    ckså
    -0.47
     Extra
    -0.46
    клопе
    -0.46
    InputBorder
    -0.45
    Less
    -0.45
    __':
    
    -0.45
    页面存档备份
    -0.44
    pēc
    -0.44
    POSITIVE LOGITS
     raise
    0.94
     increase
    0.94
     raising
    0.93
     boost
    0.93
     boosting
    0.87
     enhance
    0.85
     Raise
    0.83
     raises
    0.82
     enhancing
    0.82
     Raising
    0.82
    Act Density 0.000%

    No Known Activations