INDEX
Explanations
names of individuals, specifically with the surname "Phillips"
mentions of the name "Phillips."
New Auto-Interp
Negative Logits
ERAL
-0.76
ctors
-0.73
fy
-0.71
gary
-0.71
Interstitial
-0.70
clud
-0.70
flow
-0.70
gha
-0.68
hibition
-0.67
quo
-0.65
POSITIVE LOGITS
Lovecraft
1.00
ippi
0.99
ussen
0.89
atile
0.89
ions
0.79
sey
0.77
ioned
0.76
pie
0.76
ipher
0.73
hift
0.72
Activations Density 0.041%