INDEX
Explanations
the word "only" followed by a noun or pronoun, suggesting a unique or exclusive situation
New Auto-Interp
Negative Logits
rers
-0.70
ayne
-0.70
urchase
-0.67
respectively
-0.67
ullah
-0.66
arbon
-0.65
ulk
-0.65
hops
-0.65
wash
-0.64
ensibly
-0.64
POSITIVE LOGITS
thing
0.84
culprit
0.83
beneficiary
0.75
troubled
0.74
frontier
0.73
pmwiki
0.73
contender
0.71
phenomenon
0.70
accol
0.69
aspect
0.69
Activations Density 0.048%