INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    tedt
    -0.64
     erz
    -0.55
    kannt
    -0.50
     gé
    -0.46
     l
    -0.46
     surf
    -0.45
     Beach
    -0.44
     gelombang
    -0.44
     côn
    -0.44
    了出去
    -0.44
    POSITIVE LOGITS
     myſelf
    1.08
     itſelf
    1.07
    IsContent
    1.02
     houſe
    0.87
     purpoſe
    0.84
    ISupport
    0.82
     useDispatch
    0.81
     SafeMath
    0.81
    InjectAttribute
    0.81
    ſelf
    0.81
    Act Density 0.021%

    No Known Activations