INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    <bos>
    -1.87
     gratify
    -0.63
     forbade
    -0.60
     mobilize
    -0.59
     rehabilitate
    -0.57
     mobilized
    -0.56
     ویکی‌آمباردا
    -0.55
     rejoiced
    -0.54
     reinstate
    -0.54
     prioritise
    -0.54
    POSITIVE LOGITS
     Ken
    1.62
     KEN
    1.57
    Ken
    1.54
     ken
    1.37
    KEN
    1.14
    Kenneth
    1.13
     Kenneth
    1.13
     Keny
    1.04
     kenzo
    0.98
    ken
    0.97
    Act Density 0.306%

    No Known Activations