INDEX
    Explanations

    references to collaboration and support among individuals in different contexts

    New Auto-Interp
    Negative Logits
    frey
    -0.17
    fef
    -0.17
    FIG
    -0.14
    quier
    -0.14
    oord
    -0.14
    isoft
    -0.14
    bara
    -0.14
    ItemImage
    -0.14
    ermo
    -0.13
     vyh
    -0.13
    POSITIVE LOGITS
     countless
    0.18
     unnamed
    0.16
    383
    0.15
     wherever
    0.15
    761
    0.15
    959
    0.14
    469
    0.14
    ocked
    0.14
    147
    0.14
     those
    0.14
    Act Density 0.049%

    No Known Activations