INDEX
Explanations
social media posts with mentions or shared content
social media interactions and postings
New Auto-Interp
Negative Logits
Leilan
-0.69
cases
-0.68
ppo
-0.67
vernment
-0.63
tabl
-0.57
BG
-0.56
inher
-0.56
kins
-0.55
Inher
-0.54
funer
-0.52
POSITIVE LOGITS
avid
0.66
Wed
0.63
в
0.61
edin
0.61
nesday
0.59
estamp
0.57
Ø
0.57
Tue
0.56
Ùħ
0.56
Tue
0.56
Activations Density 0.056%