INDEX
Explanations
terms related to communications technology and social media platforms
references to digital communication and social media
New Auto-Interp
Negative Logits
Ĭ±
-0.60
referen
-0.59
»Ĵ
-0.55
"{-0.54
anwhile
-0.54
senal
-0.54
afort
-0.53
ļé
-0.52
romeda
-0.52
arij
-0.51
POSITIVE LOGITS
.–
0.59
.</
0.58
.'
0.55
ÂŃ
0.52
.''
0.52
or
0.52
,''
0.52
lip
0.51
.
0.51
'.
0.50
Activations Density 1.021%