INDEX
Explanations
occurrences of the name "Philip" at a strong activation level
the name "Philip," indicating a focus on this individual's mentions throughout the text
New Auto-Interp
Negative Logits
yrinth
-0.78
eer
-0.77
DERR
-0.74
REAM
-0.70
LOAD
-0.69
inventoryQuantity
-0.68
bnb
-0.68
Ranked
-0.66
ear
-0.66
ipeg
-0.65
POSITIVE LOGITS
Randolph
0.90
anthrop
0.84
Morris
0.83
osate
0.74
son
0.74
Seymour
0.73
istine
0.71
Philip
0.70
ophe
0.70
lectic
0.69
Activations Density 0.005%