INDEX
    Explanations

    references to the possibility of different events or actions taking place

    expressions of potential or hypothetical scenarios

    New Auto-Interp
    Negative Logits
    ogie
    -0.84
    gar
    -0.79
    ulu
    -0.76
    eye
    -0.76
    artney
    -0.75
    rix
    -0.74
    ging
    -0.74
    ilver
    -0.71
    waters
    -0.71
    gars
    -0.71
    POSITIVE LOGITS
    ossibility
    0.99
     possibility
    0.84
     confir
    0.84
     horizon
    0.76
    xual
    0.76
     unnecess
    0.76
     Rouhani
    0.75
     hypot
    0.74
     pron
    0.74
     00000000
    0.73
    Act Density 0.015%

    No Known Activations