INDEX
Explanations
conjunctions and references to adding or emphasizing qualities within descriptions
New Auto-Interp
Negative Logits
oog
-0.17
Keys
-0.17
ugas
-0.16
keys
-0.14
Soap
-0.14
spoilers
-0.14
aptive
-0.14
м
-0.13
ocular
-0.13
turnstile
-0.13
POSITIVE LOGITS
ead
0.15
SizePolicy
0.15
ilton
0.15
Papa
0.14
ione
0.14
eg
0.14
èŃ
0.13
straightforward
0.13
WT
0.13
eb
0.13
Activations Density 0.026%