INDEX
    Explanations

    mentions of the name "Philip" and its variants, including specific titles and context related to individuals named Philip

    New Auto-Interp
    Negative Logits
    ils
    -0.15
    lify
    -0.15
    ude
    -0.15
     ifs
    -0.15
     fab
    -0.14
    ê¸ī
    -0.14
    anean
    -0.14
    ابع
    -0.13
    ilen
    -0.13
    صÙģ
    -0.13
    POSITIVE LOGITS
    stal
    0.15
    xec
    0.14
    ons
    0.14
    son
    0.14
    zig
    0.14
    fully
    0.14
    rone
    0.14
    eness
    0.14
    sono
    0.14
    æī¶
    0.14
    Act Density 0.005%

    No Known Activations