INDEX
Explanations
mentions or references to the name "Phil"
mentions of the name "Phil" in various contexts
New Auto-Interp
Negative Logits
CLASSIFIED
-0.96
tenance
-0.78
DERR
-0.75
erers
-0.74
eer
-0.68
yrinth
-0.68
GGGG
-0.68
erness
-0.65
ãĥ¼ãĥĨãĤ£
-0.65
Elves
-0.64
POSITIVE LOGITS
anthrop
1.44
istine
1.36
andering
1.14
adel
1.11
oton
1.10
onen
1.07
harm
1.05
ips
1.02
onite
1.01
ophe
1.01
Activations Density 0.015%