INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ellipsis
    -0.08
    budget
    -0.07
     supermercados
    -0.07
    perl
    -0.07
    tsx
    -0.07
    dust
    -0.07
    oji
    -0.07
     Bin
    -0.07
    valo
    -0.07
    caption
    -0.07
    POSITIVE LOGITS
     teachings
    0.16
     mentor
    0.15
     mentorship
    0.15
    导师
    0.15
     discíp
    0.15
     disciples
    0.15
     lineage
    0.15
     mentors
    0.14
     apprenticeship
    0.14
     disciple
    0.13
    Act Density 0.055%

    No Known Activations