INDEX
Explanations
phrases related to possession or control
terms associated with concepts of control and governance
New Auto-Interp
Negative Logits
redacted
-0.62
candid
-0.62
vasive
-0.59
livious
-0.58
admitting
-0.57
Invalid
-0.56
Rhod
-0.54
ipolar
-0.53
accompanied
-0.53
Missing
-0.52
POSITIVE LOGITS
asses
0.84
vre
0.84
aces
0.78
adle
0.74
insula
0.72
irements
0.72
igree
0.71
rils
0.70
ills
0.69
elight
0.68
Activations Density 0.266%