INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
     we
    -1.31
     so
    -1.25
     could
    -1.24
     was
    -1.24
     can
    -1.24
     is
    -1.24
     they
    -1.23
     are
    -1.22
     it
    -1.21
     in
    -1.20
    POSITIVE LOGITS
    <bos>
    8.87
     ftu
    2.36
     autunno
    2.29
     ftre
    2.25
     appunt
    2.21
     sappi
    2.21
     fta
    2.20
     fatis
    2.18
     poft
    2.13
     affez
    2.12
    Act Density 0.000%

    No Known Activations

    This feature has no known activations.