INDEX
Explanations
prepositional phrases indicating a specific relationship between entities
references to independence or self-sufficiency
New Auto-Interp
Negative Logits
Relations
-0.61
APD
-0.60
Dialogue
-0.56
aukee
-0.55
ifax
-0.55
knit
-0.54
pod
-0.54
Warfare
-0.53
sol
-0.53
halla
-0.52
POSITIVE LOGITS
cost
1.17
expense
0.99
reason
0.96
price
0.93
cost
0.90
Reason
0.89
whim
0.84
costs
0.84
chance
0.82
merit
0.80
Activations Density 0.455%