INDEX
Explanations
words related to attributes and classifications of substances or entities, particularly in scientific contexts
New Auto-Interp
Negative Logits
Merc
-0.15
elf
-0.15
ows
-0.15
oleon
-0.15
reverse
-0.15
bows
-0.14
iven
-0.14
ess
-0.14
inema
-0.14
çij
-0.14
POSITIVE LOGITS
EMPLARY
0.17
aginator
0.17
nice
0.16
Łèĥ½
0.16
ogui
0.15
uchar
0.14
/co
0.14
atism
0.14
alion
0.14
nob
0.14
Activations Density 0.057%