INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    pl
    -0.66
    serv
    -0.63
    Aim
    -0.60
    ForgeModLoader
    -0.60
    hazard
    -0.59
    cart
    -0.58
    pour
    -0.58
    eas
    -0.57
     chic
    -0.57
    Rated
    -0.57
    POSITIVE LOGITS
     recently
    0.76
     proven
    0.75
     lately
    0.73
    soever
    0.71
    terday
    0.69
    ndra
    0.69
     awhile
    0.68
    rely
    0.68
     now
    0.65
    ilage
    0.65
    Act Density 0.027%

    No Known Activations