INDEX
    Explanations

    places or settings within a text

    prepositions and phrases indicating location or context

    New Auto-Interp
    Negative Logits
     Challenges
    -0.69
     Janeiro
    -0.68
     Rica
    -0.67
     Puzzles
    -0.67
    enegger
    -0.65
     NAD
    -0.63
     Purs
    -0.62
    abwe
    -0.61
     Chero
    -0.61
    éģ
    -0.61
    POSITIVE LOGITS
    cast
    0.96
    usal
    0.95
    erb
    0.84
    vern
    0.80
    ork
    0.79
    harm
    0.79
    alter
    0.76
    usive
    0.75
    âĢij
    0.74
    hist
    0.74
    Act Density 0.242%

    No Known Activations