INDEX
Explanations
names of individuals involved in legal or criminal situations
mentions of individuals' names and related legal situations
New Auto-Interp
Negative Logits
ĵĺ
-0.69
advertisement
-0.68
APR
-0.66
captcha
-0.65
Nevada
-0.64
ZI
-0.62
GW
-0.58
ãĥĩãĤ£
-0.57
tains
-0.57
..."
-0.57
POSITIVE LOGITS
herself
0.74
âķIJ
0.72
itself
0.71
arently
0.70
himself
0.69
ivari
0.68
Himself
0.67
alone
0.66
olor
0.65
's
0.64
Activations Density 0.758%