INDEX
    Explanations

    mentions of specific names, likely proper nouns

    names and terms related to specific individuals and styles, particularly in the context of architecture and pop culture

    New Auto-Interp
    Negative Logits
     concess
    -0.79
    icum
    -0.74
     provision
    -0.73
    inent
    -0.71
    stru
    -0.70
    etheus
    -0.70
    ocrin
    -0.69
    sole
    -0.68
     compr
    -0.67
    iden
    -0.67
    POSITIVE LOGITS
    glers
    1.15
    vernment
    0.96
     Pengu
    0.84
    irlfriend
    0.81
    ospels
    0.80
    IVERS
    0.77
    ORGE
    0.77
    ourmet
    0.77
    gets
    0.76
    FFER
    0.75
    Act Density 0.059%

    No Known Activations