INDEX
Explanations
mentions of the name "Philip" and its variants, including specific titles and context related to individuals named Philip
New Auto-Interp
Negative Logits
ils
-0.15
lify
-0.15
ude
-0.15
ifs
-0.15
fab
-0.14
ê¸ī
-0.14
anean
-0.14
ابع
-0.13
ilen
-0.13
صÙģ
-0.13
POSITIVE LOGITS
stal
0.15
xec
0.14
ons
0.14
son
0.14
zig
0.14
fully
0.14
rone
0.14
eness
0.14
sono
0.14
æī¶
0.14
Activations Density 0.005%