INDEX
    Explanations

    text structured as news articles or formal reports

    Sentences referring to articles or documents

    New Auto-Interp
    Negative Logits
     even
    -0.48
     -
    -0.46
     it
    -0.45
     zelfs
    -0.44
    Even
    -0.44
    .
    -0.44
    ↵↵
    -0.44
     addirittura
    -0.43
    -
    -0.43
    tamanya
    -0.43
    POSITIVE LOGITS
     originally
    0.83
    Hentet
    0.81
     Efq
    0.78
    ricle
    0.73
    originally
    0.72
    IContainer
    0.72
    GEBURTSDATUM
    0.71
    ViewFeatures
    0.70
    Originally
    0.70
     Originally
    0.70
    Act Density 0.300%

    No Known Activations