INDEX
Explanations
phrases related to possession or ownership
phrases or references that include the word "of."
New Auto-Interp
Negative Logits
uador
-0.73
antine
-0.71
amsung
-0.71
bench
-0.71
ounters
-0.70
ocene
-0.70
ower
-0.69
orously
-0.68
igent
-0.68
orer
-0.67
POSITIVE LOGITS
glory
0.74
theirs
0.68
selves
0.65
course
0.64
hers
0.63
dominance
0.63
Evil
0.61
SL
0.60
counterparts
0.60
ours
0.59
Activations Density 0.183%