INDEX
Explanations
mentions of geographical locations or proper names
references to specific individuals and academic institutions
New Auto-Interp
Negative Logits
ball
-0.75
walking
-0.72
footed
-0.70
draw
-0.69
DER
-0.68
Pg
-0.64
advertising
-0.63
walker
-0.62
patrick
-0.61
alling
-0.61
POSITIVE LOGITS
shire
0.80
bard
0.76
ental
0.76
ional
0.73
iban
0.73
é¾įåĸļ士
0.71
clair
0.70
shaw
0.69
hers
0.68
asus
0.67
Activations Density 0.055%