INDEX
Explanations
the word "closest" along with related terms indicating proximity
New Auto-Interp
Negative Logits
anity
-0.61
inse
-0.60
umen
-0.60
dir
-0.58
andals
-0.58
olor
-0.57
ipel
-0.56
hots
-0.56
oat
-0.56
tails
-0.55
POSITIVE LOGITS
approximation
0.75
kindred
0.66
imaginable
0.64
neighbour
0.63
analogue
0.62
confid
0.61
competitor
0.61
closest
0.60
possible
0.58
allied
0.58
Activations Density 7.810%