INDEX
    Explanations

    phrases related to competition or rivalries

    New Auto-Interp
    Negative Logits
    <bos>
    -1.61
    //---
    -0.61
    <?
    -0.61
    -0.53
    kulum
    -0.52
    
    
    -0.52
    /*!
    
    -0.52
    /***
    
    -0.51
    pessoas
    -0.51
    //...
    -0.50
    POSITIVE LOGITS
    Rival
    1.23
     Rival
    1.19
     rival
    1.18
     rivals
    1.12
    rival
    1.11
     ecru
    1.10
     Rivals
    0.92
     stockholm
    0.91
     madonna
    0.89
     🤣🤣
    0.88
    Act Density 0.445%

    No Known Activations