INDEX
Explanations
mentions of locations or places
instances of the word "Scots."
New Auto-Interp
Negative Logits
vernment
-0.76
slic
-0.75
suspic
-0.71
mileage
-0.70
Reviewer
-0.68
livest
-0.68
laundry
-0.66
JPM
-0.66
ĪĴ
-0.66
Downloadha
-0.66
POSITIVE LOGITS
hirt
1.24
weet
1.20
leeve
1.12
hower
1.09
arnaev
1.05
heet
1.05
onic
1.03
ember
0.98
ween
0.98
rue
0.97
Activations Density 0.033%