INDEX
    Explanations

    words related to important structures or supporting elements

    terms related to structural support or foundational concepts

    New Auto-Interp
    Negative Logits
    aps
    -0.91
    Attempts
    -0.79
    vertisements
    -0.79
    avers
    -0.77
     therape
    -0.76
    aped
    -0.76
    imar
    -0.76
    umatic
    -0.75
    ivan
    -0.74
    aver
    -0.73
    POSITIVE LOGITS
     backbone
    1.50
    layer
    0.96
    SourceFile
    0.87
     guts
    0.78
    xual
    0.74
    REDACTED
    0.73
     fibre
    0.72
    Connector
    0.72
    beard
    0.72
     bones
    0.71
    Act Density 0.010%

    No Known Activations