INDEX
    Explanations

    mentions of "system" and related concepts indicating systems or structures in various contexts

    New Auto-Interp
    Negative Logits
    ras
    -0.16
    cente
    -0.15
    ila
    -0.15
     Gem
    -0.15
    engu
    -0.15
     Freund
    -0.14
    lite
    -0.14
    ting
    -0.14
    bbie
    -0.14
    adelphia
    -0.14
    POSITIVE LOGITS
    atic
    0.14
    Tro
    0.14
     punches
    0.14
    isches
    0.14
     veter
    0.13
    Ñĩа
    0.13
    gi
    0.13
    ÏĦÏĤ
    0.13
    -wide
    0.13
    ÑĢÑĥÑĩ
    0.13
    Act Density 0.021%

    No Known Activations