INDEX
Explanations
nouns and adjectives
specific types of grammatical elements, particularly nouns and adjectives
New Auto-Interp
Negative Logits
loo
-0.80
porting
-0.72
Klux
-0.71
KER
-0.70
inness
-0.70
inventoryQuantity
-0.64
ilated
-0.63
20439
-0.62
Palest
-0.62
quished
-0.62
POSITIVE LOGITS
adjective
0.74
trope
0.74
pmwiki
0.71
adject
0.67
odic
0.67
tropes
0.66
onsense
0.65
witz
0.65
ãģĻ
0.65
ures
0.65
Activations Density 0.016%