INDEX
Explanations
references to the name "Phil" and its variations or related terms
New Auto-Interp
Negative Logits
uala
-0.20
rous
-0.17
rian
-0.16
etine
-0.16
thừa
-0.15
glGet
-0.15
ylene
-0.15
YPRE
-0.15
elaide
-0.14
ivas
-0.14
POSITIVE LOGITS
ipp
0.32
osoph
0.26
ippi
0.25
ippines
0.25
omen
0.23
osopher
0.22
ipe
0.21
adelphia
0.21
thy
0.20
andering
0.20
Activations Density 0.010%