INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     vaping
    -0.10
    .Mapper
    -0.09
     plastik
    -0.09
     Prius
    -0.08
    .tolist
    -0.08
     Kunststoff
    -0.08
     menopause
    -0.08
    消费
    -0.08
     dishwasher
    -0.08
     dosage
    -0.08
    POSITIVE LOGITS
     castle
    0.17
     castles
    0.16
     fortress
    0.16
    Castle
    0.15
     Castle
    0.14
    castle
    0.13
     Fortress
    0.13
     रक्षा
    0.13
     defensive
    0.13
    0.12
    Act Density 0.044%

    No Known Activations