INDEX
    Explanations

    phrases that indicate types or categories

    New Auto-Interp
    Negative Logits
     Guilt
    -0.60
     genieten
    -0.60
     Ballet
    -0.59
     remercier
    -0.57
     barnen
    -0.56
     évêque
    -0.55
     lắp
    -0.55
     stazione
    -0.54
     fährt
    -0.54
    たまた
    -0.54
    POSITIVE LOGITS
     Мексичка
    0.94
     kinds
    0.77
    fillType
    0.74
     ProtoMessage
    0.74
     snippetHide
    0.73
     types
    0.72
     scenario
    0.72
    Kinds
    0.70
    "):
    
    0.70
     SUDOC
    0.69
    Act Density 0.130%

    No Known Activations