INDEX
Explanations
expressions of preference and superlatives related to experiences, opportunities, or items
New Auto-Interp
Negative Logits
inen
-0.16
inem
-0.14
olars
-0.14
ãģŃ
-0.14
ourselves
-0.14
bem
-0.14
zza
-0.14
913
-0.13
modal
-0.13
òa
-0.13
POSITIVE LOGITS
thing
0.36
Thing
0.25
thing
0.25
Thing
0.23
coisa
0.21
things
0.20
EVER
0.20
ever
0.18
anyone
0.18
cosa
0.18
Activations Density 0.087%