INDEX
Explanations
names and locations, particularly mentioning an individual or place names in a statement
proper nouns, particularly names of people and places
New Auto-Interp
Negative Logits
acted
-0.69
stakes
-0.65
growth
-0.64
lest
-0.64
earth
-0.64
DERR
-0.63
suspense
-0.60
acts
-0.60
plur
-0.59
Pg
-0.57
POSITIVE LOGITS
hof
0.80
wagen
0.80
heid
0.75
abee
0.71
imore
0.70
erton
0.70
ascus
0.69
illac
0.68
ayn
0.68
ulhu
0.66
Activations Density 0.258%