INDEX
    Explanations

    words followed by definition or context

    New Auto-Interp
    Negative Logits
    0.49
    david
    0.46
    Other
    0.44
     funkcji
    0.44
    t
    0.44
    他の
    0.44
    ilver
    0.44
     from
    0.43
    0.43
     ሌሎች
    0.42
    POSITIVE LOGITS
     adoles
    0.46
     erop
    0.43
    0.42
     preschool
    0.41
     kindergarten
    0.41
     aislamiento
    0.41
    नियर
    0.40
     ignores
    0.40
    0.39
     simplemente
    0.39
    Act Density 0.005%

    No Known Activations