INDEX
Explanations
the word "Santa," possibly in the context of a name or title
proper nouns related to locations, specifically those starting with "Santa"
mentions of "Santa."
New Auto-Interp
Negative Logits
edly
-0.73
ochond
-0.69
ives
-0.67
td
-0.65
points
-0.65
staking
-0.64
STD
-0.64
hower
-0.63
widget
-0.63
draw
-0.62
POSITIVE LOGITS
Claus
1.52
Clara
1.07
Santa
1.07
Ana
0.94
Monica
0.93
Santa
0.91
Barbara
0.87
Rosa
0.84
Cruz
0.83
Maria
0.82
Activations Density 0.008%