INDEX
Explanations
special characters or symbols within text
instances of a specific character or symbol in text
New Auto-Interp
Negative Logits
ŃĶ
-0.74
icult
-0.71
ysis
-0.67
ocument
-0.66
ijn
-0.66
Blu
-0.63
frag
-0.63
uers
-0.63
ici
-0.62
#$#$
-0.62
POSITIVE LOGITS
––
1.04
âĪĴ
0.87
cases
0.83
advertisement
0.82
-+
0.82
–
0.79
issues
0.79
micro
0.78
style
0.77
mediated
0.76
Activations Density 0.019%