INDEX
Explanations
instances of words related to people
mentions of "people."
New Auto-Interp
Negative Logits
Accessory
-0.82
inventoryQuantity
-0.77
tnc
-0.77
NES
-0.75
Emer
-0.70
srfAttach
-0.69
Stre
-0.69
COMPLE
-0.69
SPONSORED
-0.68
Lank
-0.68
POSITIVE LOGITS
smugglers
1.01
who
0.91
folk
0.86
opausal
0.74
iuses
0.73
hood
0.72
bara
0.71
gling
0.71
misunderstood
0.70
else
0.70
Activations Density 0.117%