INDEX
    Explanations

    references to different classes or categorizations in a variety of contexts

    "class" or "classes"

    New Auto-Interp
    Negative Logits
     bandeira
    -0.56
     noDo
    -0.56
    OGND
    -0.54
    Conteúdo
    -0.53
     iluminação
    -0.52
     voedsel
    -0.49
     חיצוניים
    -0.48
     maquiagem
    -0.47
    mentação
    -0.47
     galinha
    -0.47
    POSITIVE LOGITS
     Class
    1.07
     CLASS
    0.94
    Class
    0.92
    CLASS
    0.84
    class
    0.83
     class
    0.82
     classe
    0.81
     Classe
    0.81
     Classes
    0.76
    classe
    0.74
    Act Density 0.156%

    No Known Activations