INDEX
Explanations
words related to satisfaction and approval
words related to satisfaction and acceptable evaluations
New Auto-Interp
Negative Logits
umn
-0.69
olan
-0.68
artifacts
-0.68
pmwiki
-0.66
Torn
-0.65
fi
-0.65
enic
-0.64
planes
-0.64
plane
-0.63
shortened
-0.63
POSITIVE LOGITS
Satisf
1.16
satisfied
1.14
satisfaction
1.07
atisf
1.03
satisfy
0.98
actory
0.91
satisfies
0.91
satisf
0.87
ysis
0.87
MENTS
0.87
Activations Density 0.017%