INDEX
Explanations
Twitter handles
Twitter handles and user mentions in the text
New Auto-Interp
Negative Logits
Association
-0.92
Parish
-0.87
Evaluation
-0.82
Reconstruction
-0.81
Act
-0.80
CSI
-0.80
Penal
-0.79
Transparency
-0.78
Rend
-0.78
Advisory
-0.78
POSITIVE LOGITS
john
1.33
mma
1.33
mad
1.32
ngth
1.32
christ
1.31
phil
1.30
brown
1.30
wild
1.29
podcast
1.29
kid
1.28
Activations Density 0.178%