INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    فاعل
    -0.09
     chest
    -0.09
     brug
    -0.08
     ([]
    -0.08
     arriv
    -0.08
     εργ
    -0.08
    Chest
    -0.08
     []↵↵
    -0.08
     соб
    -0.08
     rails
    -0.08
    POSITIVE LOGITS
     saints
    0.08
     Row
    0.08
    .Column
    0.08
    <Row
    0.08
    egl
    0.08
    roads
    0.07
    0.07
    	Page
    0.07
    geordnet
    0.07
     teachers
    0.07
    Act Density 0.000%

    No Known Activations