INDEX
Explanations
proper nouns or specific names
key entities, particularly in academic or historical contexts
New Auto-Interp
Negative Logits
icient
-0.57
isSpecialOrderable
-0.55
inki
-0.55
kefeller
-0.53
etsy
-0.53
teasp
-0.50
sylv
-0.49
cest
-0.49
catentry
-0.49
ãĥĩãĤ£
-0.48
POSITIVE LOGITS
Jr
0.61
ulhu
0.58
ĪĴ
0.56
Method
0.53
Pattern
0.51
Ass
0.51
ALLY
0.50
lang
0.50
vez
0.48
abbage
0.46
Activations Density 1.362%