INDEX
    Explanations

    negative physical states or conditions

    New Auto-Interp
    Negative Logits
     Verd
    -0.15
     Robinson
    -0.15
    uyu
    -0.15
    aksi
    -0.14
    aleza
    -0.14
    udu
    -0.14
    .ShouldBe
    -0.13
     Pornhub
    -0.13
    ühl
    -0.13
     CascadeType
    -0.13
    POSITIVE LOGITS
     like
    0.46
     zoals
    0.35
     Like
    0.34
    å¦Ĥ
    0.34
    Like
    0.30
     wie
    0.30
     như
    0.29
     seperti
    0.29
     å¦Ĥ
    0.28
     como
    0.28
    Act Density 0.308%

    No Known Activations