INDEX
    Explanations

    instances where something is being modified or adjusted in some way

    phrases related to changes or adjustments made to things

    New Auto-Interp
    Negative Logits
    ãĥīãĥ©
    -0.77
    vice
    -0.69
    urus
    -0.65
    20439
    -0.61
    ãĥĪ
    -0.61
     haunt
    -0.59
    inar
    -0.59
    ublic
    -0.58
    Champ
    -0.58
    netflix
    -0.57
    POSITIVE LOGITS
     considerably
    1.25
     slightly
    1.10
     greatly
    1.02
     further
    1.01
     accordingly
    1.01
     substantially
    0.99
     drastically
    0.96
     sufficiently
    0.95
     significantly
    0.94
     dramatically
    0.92
    Act Density 0.345%

    No Known Activations