INDEX
Explanations
selfies, both being taken and posted
references to selfies and related actions
New Auto-Interp
Negative Logits
endez
-0.84
profits
-0.76
endale
-0.76
ONSORED
-0.75
ends
-0.74
iance
-0.74
Flavoring
-0.73
end
-0.73
iant
-0.73
artment
-0.72
POSITIVE LOGITS
selfies
1.07
selfie
0.99
OPLE
0.75
Gallery
0.75
Doodle
0.73
onstage
0.73
tan
0.70
pics
0.69
nude
0.69
outdoors
0.68
Activations Density 0.026%