INDEX
Explanations
mentions of a location or place
instances of the article "a."
New Auto-Interp
Negative Logits
shares
-0.73
intentions
-0.71
adherents
-0.71
honors
-0.70
agents
-0.69
indicators
-0.68
meanings
-0.68
endors
-0.68
pics
-0.66
affiliates
-0.65
POSITIVE LOGITS
multitude
0.92
bunch
0.92
plethora
0.91
hundred
0.90
whopping
0.89
lengthy
0.89
sizeable
0.89
dozen
0.87
lot
0.87
handful
0.87
Activations Density 0.430%