INDEX
Explanations
expressions related to showing approval or disapproval
references to "thumbs" and related expressions of approval or disapproval
New Auto-Interp
Negative Logits
Occupations
-0.76
mingham
-0.75
اÙĦ
-0.73
hammad
-0.73
iveness
-0.70
places
-0.70
ÙĦ
-0.70
ulkan
-0.69
Downloadha
-0.68
urity
-0.67
POSITIVE LOGITS
thumb
1.09
hole
0.83
0.81
eenth
0.79
sticks
0.79
wheel
0.79
pad
0.79
tie
0.78
tip
0.77
finger
0.77
Activations Density 0.018%