INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    urses
    -0.69
     charism
    -0.67
     Osaka
    -0.65
     humid
    -0.63
     jun
    -0.63
     Lauder
    -0.62
     fortunes
    -0.62
     stash
    -0.61
     ori
    -0.61
     stray
    -0.59
    POSITIVE LOGITS
    adium
    0.83
    tones
    0.78
    bucks
    0.70
    Artist
    0.70
    die
    0.69
    bard
    0.67
    Virgin
    0.66
     Werewolf
    0.65
    200000
    0.62
    peak
    0.61
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.