INDEX
Explanations
words related to transformations, changes, and transitions
entities or concepts related to identity transformation or status changes
New Auto-Interp
Negative Logits
balances
-0.63
arton
-0.62
bene
-0.61
similarity
-0.60
caveats
-0.60
unpublished
-0.59
)\
-0.58
ups
-0.58
ibrary
-0.57
misconception
-0.57
POSITIVE LOGITS
fodder
0.81
scapego
0.78
livion
0.77
sonian
0.75
anew
0.75
fledged
0.73
overnight
0.72
Normal
0.71
unemploy
0.70
resistant
0.67
Activations Density 0.361%