INDEX
    Explanations

    references to file names and related attributes in documents

    New Auto-Interp
    Negative Logits
    a
    -0.61
    Protein
    -0.56
     parlé
    -0.52
     pubblici
    -0.52
     Protein
    -0.52
     protein
    -0.51
    protein
    -0.51
     Ambrose
    -0.48
     природе
    -0.45
    o
    -0.45
    POSITIVE LOGITS
     filename
    1.74
    filename
    1.55
    Filename
    1.14
     Filename
    1.07
     filenames
    1.04
    matchCondition
    1.00
    FILENAME
    0.99
     Chwiliwch
    0.92
    filenames
    0.88
     ModelRenderer
    0.88
    Act Density 0.037%

    No Known Activations