INDEX
Explanations
expressions related to claims or assertions within a legal or formal context
New Auto-Interp
Negative Logits
Lobby
-0.17
lobby
-0.16
rm
-0.14
ariat
-0.14
undermin
-0.14
ieber
-0.14
PLICIT
-0.14
º«
-0.13
unist
-0.13
issen
-0.13
POSITIVE LOGITS
cynical
0.16
anger
0.16
ubre
0.15
ायन
0.15
cyn
0.15
negative
0.15
retali
0.15
complaints
0.14
mand
0.14
Mand
0.14
Activations Density 0.015%