INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     busy
    -1.44
    busy
    -1.20
    Busy
    -1.17
     Busy
    -1.09
     sibuk
    -1.03
     tour
    -0.96
     busiest
    -0.88
     toured
    -0.84
    tour
    -0.84
     CreateTagHelper
    -0.82
    POSITIVE LOGITS
     Diſ
    0.82
     purpoſe
    0.73
     Houſe
    0.71
     uſe
    0.67
     Inſ
    0.67
     Theſe
    0.65
     laſt
    0.65
     Reſ
    0.63
     itſelf
    0.63
    baw
    0.62
    Act Density 0.299%

    No Known Activations