INDEX
Explanations
social media actions, such as following, blocking, or unblocking, and their associated dates
references to social media interactions or updates
New Auto-Interp
Negative Logits
corrid
-0.74
ierrez
-0.67
accompanied
-0.63
pregn
-0.63
istani
-0.61
renheit
-0.60
keley
-0.59
incor
-0.59
appell
-0.58
sterdam
-0.58
POSITIVE LOGITS
Comments
0.84
Posts
0.83
Allows
0.83
Adds
0.83
Latest
0.81
Availability
0.81
Invalid
0.78
Likes
0.77
Tweet
0.76
05
0.74
Activations Density 0.118%