INDEX
Explanations
phrases indicating choices or alternatives
the word "or" used in various contexts
New Auto-Interp
Negative Logits
ernels
-0.95
lees
-0.91
flows
-0.81
doms
-0.80
Attacks
-0.78
Accounts
-0.77
kins
-0.76
ouses
-0.76
Rs
-0.76
akes
-0.75
POSITIVE LOGITS
bracelet
0.94
charger
0.91
pamphlet
0.88
a
0.86
chard
0.85
two
0.84
necklace
0.84
thinker
0.84
proposition
0.84
piece
0.84
Activations Density 0.185%