INDEX
Explanations
words related to concepts of directionality and causality
concepts related to various types of "ality," such as morality, legality, and functionality
New Auto-Interp
Negative Logits
sonian
-0.89
kers
-0.83
edes
-0.79
fman
-0.79
king
-0.75
iors
-0.73
ribut
-0.72
arte
-0.71
href
-0.70
eric
-0.70
POSITIVE LOGITS
istically
0.71
contag
0.69
Cloak
0.68
Rica
0.65
endi
0.61
Scotia
0.60
butt
0.60
pact
0.58
lapse
0.58
hound
0.58
Activations Density 0.040%