INDEX
    Explanations

    mentions of personal preferences or tastes

    phrases related to personal tastes and preferences

    New Auto-Interp
    Negative Logits
     Chern
    -0.81
    BILITY
    -0.80
    boxing
    -0.79
    ICA
    -0.78
    Assembly
    -0.75
     Brotherhood
    -0.71
     Ded
    -0.69
    ANCE
    -0.69
    Veter
    -0.68
    à¤
    -0.68
    POSITIVE LOGITS
    omething
    1.24
    avorite
    1.17
    ettings
    1.14
    ometimes
    1.10
    cape
    0.99
    hops
    0.98
    uggest
    0.97
     tastes
    0.94
    ystem
    0.93
    wana
    0.93
    Act Density 0.019%

    No Known Activations