INDEX
Explanations
occurrences of the name "Philip" or related variations in the text
New Auto-Interp
Negative Logits
ertura
-0.16
"":
-0.15
vette
-0.14
daq
-0.14
ional
-0.14
imals
-0.14
ãĤīãģı
-0.14
charg
-0.14
uteur
-0.14
mittel
-0.14
POSITIVE LOGITS
pe
0.28
ppe
0.22
ps
0.20
ipp
0.18
pos
0.18
ipe
0.18
ippi
0.17
pon
0.17
pen
0.17
pron
0.16
Activations Density 0.007%