INDEX
Explanations
the word "thumbs" or related terms like "thumbs up" or "thumbs down"
references to approval or disapproval, particularly through the expression of "thumbs" actions
New Auto-Interp
Negative Logits
lain
-0.86
Lauder
-0.85
ioned
-0.79
nets
-0.74
Cox
-0.70
abella
-0.68
anz
-0.67
inian
-0.67
lé
-0.67
anas
-0.66
POSITIVE LOGITS
++++
0.95
umbs
0.82
âĺħâĺħ
0.81
thumbs
0.80
urger
0.78
Quantity
0.77
xual
0.75
pe
0.74
peed
0.73
veyard
0.70
Activations Density 0.042%