INDEX
    Explanations

    mentions of costumes or cosplay

    the word 'cos' and its variations in different contexts

    New Auto-Interp
    Negative Logits
     Hearts
    -0.67
     Lans
    -0.65
     Bulls
    -0.64
     forged
    -0.61
     Giles
    -0.61
     Dah
    -0.61
     Alph
    -0.61
     Beaver
    -0.61
     rake
    -0.61
     Zucker
    -0.60
    POSITIVE LOGITS
    mopolitan
    1.65
    metics
    1.40
    mop
    1.32
    mic
    1.31
    mology
    1.31
    metic
    1.28
    met
    1.19
    mosp
    1.19
    mos
    1.12
    mo
    1.10
    Act Density 0.027%

    No Known Activations