INDEX
    Explanations

    proper nouns, possibly related to controversies or conflicts

    mentions of specific names and terms related to people or entities in context

    New Auto-Interp
    Negative Logits
    esthetic
    -0.98
    ily
    -0.83
    esthesia
    -0.82
    meric
    -0.80
    eco
    -0.75
    ijk
    -0.73
    iland
    -0.73
    erm
    -0.72
    ilic
    -0.72
    java
    -0.72
    POSITIVE LOGITS
    aneous
    0.96
     Cumber
    0.84
     Lauder
    0.81
    hyde
    0.80
    aire
    0.77
    batch
    0.76
    ative
    0.75
    ror
    0.70
    FORE
    0.70
    aneously
    0.69
    Act Density 0.190%

    No Known Activations