INDEX
Explanations
entities compared as being similar or equivalent to one another
New Auto-Interp
Negative Logits
tein
-0.82
ular
-0.64
Pastebin
-0.63
pat
-0.63
omatic
-0.61
elfth
-0.61
cel
-0.60
=-=-
-0.60
Ging
-0.59
NER
-0.58
POSITIVE LOGITS
alike
1.39
soever
0.86
WHERE
0.80
lihood
0.77
rejoice
0.77
sexes
0.75
greets
0.73
strives
0.71
fascinated
0.70
perished
0.69
Activations Density 0.025%