INDEX
    Explanations

    references to hierarchical relationships and comparisons between concepts

    New Auto-Interp
    Negative Logits
    zos
    -0.17
    PT
    -0.15
    lrt
    -0.15
    dbcTemplate
    -0.15
    urb
    -0.15
    hausen
    -0.14
    âķĹ
    -0.14
    maal
    -0.13
    ton
    -0.13
    'ÑĶ
    -0.13
    POSITIVE LOGITS
     another
    0.36
    another
    0.31
     Another
    0.30
    Another
    0.27
     others
    0.23
    åı¦
    0.21
     otro
    0.19
     дÑĢÑĥгой
    0.19
     Others
    0.19
     otra
    0.18
    Act Density 0.043%

    No Known Activations