INDEX
Explanations
thumbs-related phrases like "thumbs up" and "thumbs down"
references to gestures of approval or disapproval, specifically "thumbs up" and "thumbs down."
New Auto-Interp
Negative Logits
Invasion
-0.70
Tale
-0.69
£ı
-0.68
enario
-0.68
lain
-0.65
anny
-0.65
arian
-0.63
sheltered
-0.63
tracing
-0.63
olate
-0.63
POSITIVE LOGITS
thumbs
3.92
umbs
1.61
cheers
1.36
majorities
0.94
disabilities
0.92
enthusiastic
0.89
smiles
0.89
votes
0.86
nods
0.85
congr
0.83
Activations Density 0.034%