INDEX
Explanations
references to social media platforms, particularly Instagram and Twitter
New Auto-Interp
Negative Logits
AndEndTag
-0.69
éché
-0.66
__':
-0.63
__":
-0.63
__":
-0.62
vois
-0.61
TTE
-0.60
gobiernos
-0.60
]=$
-0.60
">'.$
-0.59
POSITIVE LOGITS
3.17
2.90
2.75
2.25
2.22
1.96
Insta
1.64
Insta
1.51
insta
1.42
1.38
Activations Density 0.066%