INDEX
    Explanations

    religious, moral, or sexual thoughts

    New Auto-Interp
    Negative Logits
     cuál
    1.01
     cuánto
    0.94
    Cuáles
    0.93
     가수
    0.89
     каком
    0.89
    Ջ
    0.88
     Flickr
    0.88
     Podcast
    0.88
     какое
    0.86
    ambio
    0.86
    POSITIVE LOGITS
    Dimensional
    1.01
    night
    0.91
    ilets
    0.88
     unequal
    0.87
    rectangular
    0.86
    msız
    0.86
    ли
    0.85
    bodies
    0.84
    rounded
    0.84
    ണ്ടു
    0.83
    Act Density 0.001%

    No Known Activations