INDEX
Explanations
expressions of care and the concept of what matters in personal judgments or opinions
New Auto-Interp
Negative Logits
*__
-0.64
stini
-0.56
みると
-0.51
sempre
-0.50
אופ
-0.49
oper
-0.48
,:),
-0.48
som
-0.48
densed
-0.48
aculture
-0.48
POSITIVE LOGITS
LookAnd
0.92
whatsoever
0.82
مرئيه
0.78
AssemblyCulture
0.76
whatever
0.74
XtraGrid
0.73
regardless
0.72
хьтан
0.71
برانيه
0.70
ьаж
0.70
Activations Density 0.271%