INDEX
    Explanations

    names and ship designations

    New Auto-Interp
    Negative Logits
    -3.94
    -3.44
    -3.31
    -3.25
    ه
    -3.16
    0
    -3.16
    -3.05
    м
    -3.00
    -2.92
    -2.92
    POSITIVE LOGITS
    2.88
    2.77
    2.56
    2.41
    ’,
    2.41
    各种
    2.34
     privadas
    2.34
    2.25
    ’?
    2.25
    ”—
    2.25
    Act Density 0.011%

    No Known Activations