INDEX
Explanations
links or URL patterns
web links or URLs
New Auto-Interp
Negative Logits
Majesty
-0.77
Emin
-0.71
REF
-0.66
Subst
-0.65
aceae
-0.64
¶
-0.64
RET
-0.64
Frankfurt
-0.63
İĭ
-0.61
multipl
-0.61
POSITIVE LOGITS
charism
0.72
usat
0.68
favorites
0.65
hani
0.61
captcha
0.61
govtrack
0.60
iframe
0.57
politician
0.57
WP
0.56
ernandez
0.55
Activations Density 0.114%