INDEX
Explanations
phrases indicating expectations or demands from individuals or groups
New Auto-Interp
Negative Logits
Yates
-0.16
.setResult
-0.16
pany
-0.14
meg
-0.14
plit
-0.14
oup
-0.14
rema
-0.14
Ez
-0.14
izard
-0.14
olders
-0.14
POSITIVE LOGITS
same
0.21
Applied
0.20
applied
0.19
SAME
0.19
similarly
0.18
Applied
0.18
same
0.17
apply
0.17
Same
0.17
Apply
0.16
Activations Density 0.172%