INDEX
Explanations
food-related words or descriptions
occurrences of the article "a" indicating new experiences or items
New Auto-Interp
Negative Logits
AIDS
-0.94
Contents
-0.82
Ebola
-0.79
evidence
-0.78
âĢİ
-0.76
independence
-0.74
allegations
-0.73
anism
-0.71
vind
-0.70
advertising
-0.70
POSITIVE LOGITS
lot
1.39
bunch
1.36
couple
1.27
LOT
1.21
few
1.20
nice
1.15
bit
1.12
handful
1.04
decent
1.03
little
1.00
Activations Density 0.502%