INDEX
Explanations
social media likes and comments
New Auto-Interp
Negative Logits
Glob
0.42
Unknown
0.41
无比
0.38
])
0.37
সমস্যাবলী
0.37
Text
0.37
tmpobj
0.37
Movie
0.36
UNK
0.36
末
0.36
POSITIVE LOGITS
likes
0.70
Likes
0.68
टिप्प
0.66
likes
0.66
comments
0.65
liked
0.64
Likes
0.63
liking
0.60
comment
0.60
comments
0.58
Activations Density 0.000%