INDEX
Explanations
specific mentions of someone making, earns money, or doing a job
actions related to creating or producing something
New Auto-Interp
Negative Logits
thia
-0.72
assis
-0.69
heter
-0.68
hood
-0.68
Mania
-0.68
stration
-0.64
SPONSORED
-0.63
Niet
-0.62
Fram
-0.60
agogue
-0.60
POSITIVE LOGITS
mistakes
1.10
money
1.08
decisions
1.05
sure
1.02
pilgr
1.00
strides
0.99
sacrifices
0.95
documentaries
0.92
noises
0.92
hift
0.91
Activations Density 0.121%