INDEX
    Explanations

    references to fantasy worlds and literary works, particularly those associated with Tolkien

    New Auto-Interp
    Negative Logits
     XPAR
    -0.15
    Ñģов
    -0.15
     visibility
    -0.14
    worthy
    -0.14
     disfr
    -0.14
    ilar
    -0.14
     Pru
    -0.14
    aub
    -0.14
    rians
    -0.14
    Advisor
    -0.13
    POSITIVE LOGITS
    ác
    0.17
    esta
    0.17
    efa
    0.16
    odule
    0.15
    ARCHAR
    0.14
    rawer
    0.14
    anford
    0.14
    ullan
    0.14
    ingham
    0.14
    igit
    0.14
    Act Density 0.001%

    No Known Activations