INDEX
Explanations
the word "of" and related phrases indicating possession or association
New Auto-Interp
Negative Logits
reys
-0.80
abases
-0.75
apons
-0.74
pta
-0.72
ctors
-0.68
notations
-0.68
Regions
-0.68
pak
-0.67
earances
-0.66
tsky
-0.66
POSITIVE LOGITS
inverse
0.74
gamble
0.71
annoyance
0.69
inconvenience
0.68
uphill
0.65
fun
0.65
hassle
0.64
necessity
0.64
luck
0.64
bother
0.63
Activations Density 0.031%