INDEX
    Explanations

    occurrences of the term "Hon" or its variations, indicating a focus on honors or titles

    New Auto-Interp
    Negative Logits
    füh
    -0.15
    ennial
    -0.14
    ssid
    -0.14
     opak
    -0.14
    incinn
    -0.14
    ago
    -0.13
    ulton
    -0.13
    iffin
    -0.13
    ottage
    -0.13
    ypi
    -0.13
    POSITIVE LOGITS
    ettle
    0.15
    ing
    0.15
    astery
    0.15
    acin
    0.14
    mile
    0.14
     Barb
    0.14
     Nest
    0.14
    nested
    0.14
    emann
    0.14
    ettel
    0.14
    Act Density 0.005%

    No Known Activations