INDEX
Explanations
prepositions like "in" that indicate location or position
New Auto-Interp
Negative Logits
Donation
-0.75
SHARES
-0.68
empath
-0.67
ange
-0.65
disgusted
-0.63
onica
-0.58
toughness
-0.57
dishonest
-0.56
happiest
-0.55
aturday
-0.55
POSITIVE LOGITS
drawn
0.96
accessible
0.90
ked
0.87
fruition
0.86
moth
0.85
activated
0.84
escap
0.84
circulation
0.82
urated
0.82
jected
0.82
Activations Density 0.177%