INDEX
    Explanations

    names and references to people in various contexts

    New Auto-Interp
    Negative Logits
    ftagPool
    -0.57
    +#+#
    -0.54
     لينك
    -0.54
    Datuak
    -0.53
     виправивши
    -0.48
     kaarangay
    -0.47
     BorderRadius
    -0.46
    TestingModule
    -0.46
    enderror
    -0.45
    Hentet
    -0.43
    POSITIVE LOGITS
     said
    2.03
     says
    1.88
     told
    1.76
    said
    1.68
     mengatakan
    1.62
     explained
    1.62
    says
    1.57
     Says
    1.56
     stated
    1.54
     commented
    1.50
    Act Density 0.487%

    No Known Activations