INDEX
    Explanations

    mentions of diving or related actions

    New Auto-Interp
    Negative Logits
     nakalista
    -0.68
     arbeits
    -0.66
     okam
    -0.66
    buro
    -0.66
    +#+#
    -0.66
     Nasir
    -0.64
     defecto
    -0.64
    Brien
    -0.64
    ladin
    -0.63
    لول
    -0.63
    POSITIVE LOGITS
    Som
    0.96
     dive
    0.93
     Bien
    0.88
     Som
    0.87
     diving
    0.85
     Diving
    0.85
    Diving
    0.84
    Bien
    0.83
     som
    0.83
    som
    0.82
    Act Density 0.214%

    No Known Activations