INDEX
Explanations
mentions of past events or incidents
keywords related to significant events, discrimination, healthcare, and legal issues
New Auto-Interp
Negative Logits
ãĥĩãĤ£
-0.65
plet
-0.62
ãĥ¼ãĥ
-0.59
vertisement
-0.58
hemor
-0.58
ãĥĥãĥī
-0.56
eday
-0.55
Siber
-0.54
sit
-0.54
jurisd
-0.53
POSITIVE LOGITS
¶
1.07
Copyright
0.88
=================
0.86
↵Âł
0.82
Belfast
0.81
violates
0.81
.--
0.79
depends
0.78
↵↵
0.76
↵
0.76
Activations Density 0.980%