INDEX
    Explanations

    phrases related to expressions of opinions or perspectives

    instances of a specific character or symbol

    New Auto-Interp
    Negative Logits
     organisers
    -0.75
     clitor
    -0.74
     unborn
    -0.72
     scissors
    -0.71
     apes
    -0.71
     virginity
    -0.70
     sails
    -0.67
     womb
    -0.66
     bun
    -0.66
     tyres
    -0.66
    POSITIVE LOGITS
    ï¸ı
    1.25
    âĶĢâĶĢ
    1.12
    conom
    0.98
    ----------------------------------------------------------------
    0.94
    ł
    0.93
    âĶĢâĶĢâĶĢâĶĢ
    0.87
    jj
    0.85
    HUD
    0.84
    ï¸
    0.83
    fter
    0.82
    Act Density 0.201%

    No Known Activations