INDEX
Explanations
mentions of locations and people's names, especially with special characters mixed in
references to legislative actions or significant legal decisions
New Auto-Interp
Negative Logits
exha
-0.57
Ĥİ
-0.55
carbohyd
-0.52
Pastebin
-0.51
referen
-0.51
tremend
-0.51
catentry
-0.50
ccording
-0.50
pse
-0.48
wcs
-0.47
POSITIVE LOGITS
ensis
0.58
imore
0.55
orf
0.54
ragon
0.54
ite
0.52
atics
0.51
u
0.50
bis
0.50
iland
0.49
ierre
0.49
Activations Density 2.243%