INDEX
Explanations
instances of the word "option" and related phrases indicating choices or alternatives
New Auto-Interp
Negative Logits
ilion
-0.17
anes
-0.15
ible
-0.15
ousse
-0.15
ingly
-0.14
akit
-0.14
.bundle
-0.14
urgeon
-0.14
Pé
-0.14
avings
-0.14
POSITIVE LOGITS
=options
0.17
215
0.16
weise
0.16
ìĤ¬íķŃ
0.15
ãĤ©
0.15
ìĤ¬íķŃ
0.15
owo
0.15
зи
0.15
orton
0.14
ãģĦãģ¤
0.14
Activations Density 0.072%