INDEX
    Explanations

    patterns related to social dynamics and interactions with a focus on emotional responses

    New Auto-Interp
    Negative Logits
     صوتيه
    -0.65
    ftagPool
    -0.50
    GEBURTS
    -0.47
     Bell
    -0.46
    ьаж
    -0.44
    mulos
    -0.43
    BrowserModule
    -0.43
    ParallelGroup
    -0.42
    ashita
    -0.42
     Belle
    -0.42
    POSITIVE LOGITS
     Co
    2.66
    Co
    2.54
     co
    2.37
    co
    2.35
     CO
    2.31
    CO
    2.19
     Ko
    1.89
    Ko
    1.81
     ko
    1.79
     Coh
    1.74
    Act Density 2.683%

    No Known Activations