INDEX
Explanations
words that are closely related or interchangeable with each other, often used as synonyms
terms related to concepts of being synonymous or equivalent
New Auto-Interp
Negative Logits
annis
-0.69
izons
-0.63
vae
-0.63
Psychology
-0.63
restraining
-0.61
ihad
-0.61
doors
-0.59
Dalai
-0.59
UGE
-0.59
uld
-0.58
POSITIVE LOGITS
synonymous
1.41
ophon
0.80
iso
0.78
ansk
0.77
symbol
0.74
opolis
0.74
Nadu
0.73
akis
0.72
additive
0.71
onymous
0.71
Activations Density 0.013%