INDEX
Explanations
words related to the ability to differentiate or distinguish between things
terms related to distinguishing or differentiating between categories or objects
New Auto-Interp
Negative Logits
oil
-0.69
nz
-0.65
roller
-0.62
odd
-0.62
Omni
-0.60
rollers
-0.59
MN
-0.59
eden
-0.59
wives
-0.58
md
-0.57
POSITIVE LOGITS
ively
1.08
iates
0.95
ĨĴ
0.94
iveness
0.93
warr
0.91
distinctions
0.86
iating
0.86
distinguishing
0.85
ĸļ
0.84
distinguish
0.84
Activations Density 0.023%