INDEX
    Explanations

    mentions of specific names, likely related to people

    mentions of specific individuals, particularly those with the name Enrique

    New Auto-Interp
    Negative Logits
    robe
    -0.69
     tigers
    -0.68
    rums
    -0.64
    ramid
    -0.64
     skelet
    -0.64
     sled
    -0.63
    reference
    -0.62
    yrinth
    -0.62
    rog
    -0.62
    ulative
    -0.62
    POSITIVE LOGITS
    terday
    1.02
    cius
    0.91
    jamin
    0.88
    ignt
    0.85
    vironment
    0.81
    CRIPTION
    0.79
    Ò
    0.78
     Moines
    0.78
    istence
    0.77
    kt
    0.74
    Act Density 0.012%

    No Known Activations