INDEX
Explanations
proper nouns, potentially names of individuals or places
names of individuals and entities that appear repeatedly in the text
New Auto-Interp
Negative Logits
ioned
-0.82
istas
-0.77
itude
-0.75
boards
-0.72
ivity
-0.68
arella
-0.68
paw
-0.67
ingly
-0.66
patrick
-0.66
ammy
-0.65
POSITIVE LOGITS
entric
0.96
llers
0.87
ο
0.83
earch
0.79
ι
0.76
trl
0.74
Blasio
0.73
ller
0.72
ploy
0.71
hovah
0.71
Activations Density 0.024%