INDEX
Explanations
phrases that indicate the perception or assessment of situations or people
New Auto-Interp
Negative Logits
ModelExpression
-0.88
########.
-0.74
déric
-0.73
binant
-0.65
ویکیپدیای
-0.62
estekak
-0.61
utafitiHapana
-0.61
QUENCE
-0.61
iyaki
-0.60
تقاوى
-0.60
POSITIVE LOGITS
impression
0.75
mentions
0.70
Mentions
0.66
impressions
0.65
Impression
0.64
impresion
0.64
Stocks
0.62
impression
0.61
StatelessWidget
0.60
AF
0.60
Activations Density 0.148%