INDEX
Explanations
phrases related to sharing information using messaging apps
mentions of the messaging app WhatsApp
New Auto-Interp
Negative Logits
Closure
-0.77
REF
-0.70
lished
-0.63
fitting
-0.62
demolition
-0.62
SPONSORED
-0.61
Viking
-0.61
combustion
-0.60
iling
-0.60
Bellev
-0.60
POSITIVE LOGITS
bour
0.92
hower
0.90
ocial
0.85
ilon
0.84
creen
0.84
acan
0.84
cha
0.82
pace
0.80
atre
0.79
oday
0.79
Activations Density 0.017%