INDEX
Explanations
phrases indicating the sacrifice of one aspect for another
New Auto-Interp
Negative Logits
sez
-0.18
jadx
-0.16
atti
-0.15
addon
-0.15
UIApplication
-0.15
iam
-0.14
-fr
-0.14
mans
-0.14
_LR
-0.14
ÑģÑĤа
-0.14
POSITIVE LOGITS
expense
0.35
Expense
0.33
sacrifice
0.27
expense
0.26
sacrificing
0.26
sacrificed
0.26
Kosten
0.25
Sacr
0.24
expenses
0.23
Expense
0.23
Activations Density 0.145%