INDEX
    Explanations

    phrases indicating varying levels of quality or size

    New Auto-Interp
    Negative Logits
     gorą
    -0.66
    RegressionTest
    -0.57
     ALWAYS
    -0.57
     Zapo
    -0.54
    Totally
    -0.52
     typeof
    -0.52
    Always
    -0.51
    anyak
    -0.51
    servez
    -0.51
     Always
    -0.50
    POSITIVE LOGITS
     decent
    0.88
     decently
    0.81
    considerable
    0.77
     considerable
    0.76
     fairly
    0.75
     непло
    0.74
     sizable
    0.74
     Decent
    0.70
    Abraço
    0.70
     sizeable
    0.70
    Act Density 0.255%

    No Known Activations