INDEX
Explanations
positive descriptions of wines
New Auto-Interp
Negative Logits
urette
-0.15
apter
-0.14
ilig
-0.14
ILA
-0.14
Norris
-0.14
voksne
-0.13
apel
-0.13
itre
-0.13
ipa
-0.13
Draft
-0.13
POSITIVE LOGITS
iez
0.15
à¸Ľà¸£à¸°à¸ª
0.15
å©Ĩ
0.15
\core
0.15
ennis
0.14
ุà¹Ī
0.14
Dex
0.14
onde
0.14
inston
0.14
anzi
0.14
Activations Density 0.002%