INDEX
    Explanations

    keywords in "where" clauses

    New Auto-Interp
    Negative Logits
     всем
    0.48
     многих
    0.48
     segala
    0.47
     strives
    0.41
     wszel
    0.41
     профилакти
    0.40
     всей
    0.40
     shabby
    0.40
    જર
    0.39
     男女
    0.39
    POSITIVE LOGITS
     degree
    0.50
     실제로
    0.49
    至少
    0.46
     entweder
    0.45
     증가
    0.44
     वास्तव
    0.44
     increase
    0.43
     almeno
    0.42
     gerçekten
    0.42
    実際に
    0.42
    Act Density 0.008%

    No Known Activations