INDEX
Explanations
phrases related to personal characteristics and occupations
terms related to social constructs and roles
New Auto-Interp
Negative Logits
sites
-0.77
hops
-0.75
rencies
-0.74
places
-0.73
steps
-0.72
prints
-0.71
iuses
-0.70
rooms
-0.69
rogens
-0.68
drops
-0.66
POSITIVE LOGITS
inhibitor
0.86
thereof
0.86
thereto
0.81
antidote
0.76
mustache
0.73
charger
0.73
loader
0.72
accompanying
0.71
dose
0.68
indicator
0.68
Activations Density 0.406%