INDEX
    Explanations

    phrases indicating possession or individuality

    New Auto-Interp
    Negative Logits
     itself
    -0.43
     Itself
    -0.40
     horen
    -0.39
    itself
    -0.39
    เอง
    -0.37
     herself
    -0.35
     speelt
    -0.34
     spreken
    -0.34
     sám
    -0.33
    hésite
    -0.33
    POSITIVE LOGITS
     initiative
    0.56
     kind
    0.54
    AsUp
    0.52
     pace
    0.52
     opinion
    0.51
     linkovi
    0.51
     &___
    0.50
     private
    0.49
     skin
    0.48
     special
    0.48
    Act Density 0.016%

    No Known Activations