INDEX
    Explanations

    assertions or beliefs about people's actions or characteristics

    New Auto-Interp
    Negative Logits
    therefore
    -0.63
     daher
    -0.56
     therefore
    -0.55
    +#+
    -0.54
     todėl
    -0.53
     tuttavia
    -0.52
     übrigens
    -0.51
     hence
    -0.51
     Therefore
    -0.50
    however
    -0.50
    POSITIVE LOGITS
    таратура
    0.72
     mergeFrom
    0.70
    modelBuilder
    0.69
    '}>
    0.67
    "]];
    0.66
    "}>
    0.65
    __":
    
    0.65
    ."));
    0.65
    ])):
    0.64
     المعيارى
    0.64
    Act Density 0.361%

    No Known Activations